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Preface 


This book is a textbook for a semester-long or a year-long introduc- 
tory course in abstract algebra. 

There is a lot of information in that sentence, so let us unpack 
it. First of all, this is a book, not an encyclopedia. What is the 
difference? An encyclopedia is a massive collection of information, 
while a book has a theme. Our book certainly does, and that theme 
is number theory. To be clear, this is not a text book on number 
theory, but we have decided on which topics to cover with an eye 
towards number theory, and we have included several sections that 
show applications of general algebraic ideas to topics in number the- 
ory. At the same time, a theme has variations, so we have not strictly 
restricted ourselves but have covered other topics as well. 

This is an introduction, which means that we have presupposed 
no prior knowledge of abstract algebra. We do, however, assume that 
you (the student) have had a good course in linear algebra. By a 
good course we mean one that treats vector spaces and linear trans- 
formations in general, not one that is restricted to matrix manipula- 
tions (but of course does include that). And naturally, at this point 
in your mathematical development, you should be comfortable with 
doing rigorous mathematics, and this is certainly a rigorous book. We 
prove just about everything we claim or use, except that on occasion 
we mention a result that goes beyond the bounds of this book for 
the further edification of the reader. 

There is enough material here for a year-long course, but we real- 
ize that you (the instructor) may not have the luxury of spending a 
year on it, so we have tried to write this book in a modular way, so 
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that you may choose which topics to go into, and go into them as far 
as you like, before moving on to next one, covering what you wish 
in the course of a semester. Of course, some topics are required for 
others, so your choice will not be completely free. And for you (the 
student), if you are in a one-semester course, this book offers you the 
opportunity to read further in whatever particularly interests you. 

Naturally, having written this book, we think highly of it, and 
think that it would provide an excellent basis for further study in 
abstract algebra in general. But, given our emphasis, we think that 
it would provide an ideal basis for further study in algebraic number 
theory. 

The devil is in the details, as the saying goes, so here they are. 

We begin, in Chapter 1, with set theory. This is often skipped, 
or presupposed, but we have decided to begin with it for several 
reasons. First, you may not be familiar with this material. Second, we 
treat quotients in many places in the book, so we wanted to present 
a particularly careful discussion of equivalence relations. And third, 
we wanted to take the opportunity to present the Schroder—Bernstein 
theorem (with proof, of course), which you may not be likely to see 
elsewhere. 

In Chapter 2, we turn our attention to group theory. We con- 
centrate on finite groups, but begin by treating groups in general, 
with examples such as matrix groups, so you can see the widespread 
appearance of groups throughout mathematics. (Too often, in our 
opinion, groups are treated purely for their own sake, which is of 
course appropriate in a specialized text, but is an approach that 
leads the reader to think of them in isolation rather than being of 
general interest.) We treat the standard, and essential, topics: homo- 
morphisms, subgroups, quotient groups, etc. We also prove the fun- 
damental structure theorem for finite abelian groups, and for finitely 
generated abelian groups, something that is not always done in texts 
at this level. We then have a section on applications to number the- 
ory, where we prove Fermat’s little theorem and the basic facts on 
quadratic residues, all from a group-theoretic point of view. We study 
the actions of groups on sets, in preparation for proving Cauchy’s the- 
orem, results on the structure of p-groups, and the Sylow theorems. 
We briefly treat solvable groups, as we will be studying the solvability 
of equations by radicals in our chapter on field theory. We conclude 
this chapter by studying permutations and the symmetric groups. 
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Chapter 3 deals with ring theory. We begin in complete general- 
ity, considering both commutative and noncommutative rings, and 
rings with and without 1, and we study ideals, both one-sided and 
two-sided, ring homomorphisms, and quotients. But, in line with our 
emphasis, we fairly quickly turn our attention to commutative rings 
with 1, and further to integral domains. We study polynomial rings 
and prove the Hilbert basis theorem. We concentrate on the issues 
of divisibility and unique factorization in integral domains, proving 
the standard results that all Euclidean domains are principal ideal 
domains and that all principal ideal domains are unique factorization 
domains, developing Euclid’s algorithm in the process. Our approach 
highlights the role played by the greatest common divisor (GCD), 
and on our way to our main results we define GCD domains (inte- 
gral domains in which any set of elements, not all zero, has a gcd) 
and study their properties. We again have a section on applications 
to number theory, which has two main results. First, we use the 
fact that the Gaussian integers are a Euclidean domain (which we 
have earlier proved) to give Dedekind’s proof of Fermat’s theorem 
that every prime congruent to 1 modulo 4 is a sum of two squares 
in an essentially unique way. Second, we give Zolotarev’s proof of 
Gauss’s lemma and the Law of Quadratic Reciprocity by consider- 
ing signs of permutations. We give, and prove, examples of unique 
and non-unique factorization, including, in particular, a variety of 
examples of rings of algebraic integers in quadratic fields. We con- 
sider quotient fields and localization, and study polynomial rings in 
detail. We conclude by studying prime and maximal ideals, proving 
the standard result that maximal ideals are prime, though not in 
general conversely, and also the less standard result that an integral 
domain in which all nonzero prime ideals are maximal is a principal 
ideal domain if and only if it is a unique factorization domain. 

Chapter 4 deals with field theory in general and Galois theory in 
particular. We feel that our treatment here is quite distinctive. 

We begin in a very concrete way, first showing how to make com- 
putations in field extensions. We then “front-load” our treatment 
of Galois theory by giving many examples of field extensions and 
Galois groups, even before arriving at the fundamental theorem of 
Galois theory (FTGT). Then we turn to proving the FTGT. As a 
first step, we show that an extension is Galois if and only if it is 
normal and separable. We then prove the FTGT per se. While our 
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proof follows the spirit of Artin’s approach, it is different in detail, 
and we do not need to use Dedekind’s theory of group characters, as 
Artin does. (We think that our proof is thus more direct and concep- 
tually a bit simple.) Having provide the FTGT, we go on to study 
further examples of field extensions. We prove the theorem of the 
primitive element and give a quite extensive study, far more than is 
usually done, of primitive elements in field extensions. We determine 
the structure of finite fields, and of cyclotomic fields. We conclude 
by deriving important consequences of field and Galois theory. We 
prove Abel’s theorem that the general polynomial of degree five is 
not solvable by radicals. Indeed, we prove this for any degree d > 5, 
the proof of this more general result being identical to the proof for 
d = 5. For any prime p > 5 we give an explicit construction of a 
polynomial of degree p over the rationals for which this is the case. 
(Of course, this can be done for any degree at least five, but in light 
of this being an introductory text, we do not introduce the compli- 
cations necessary to do so for the general case.) We show that the 
classical problems of antiquity — trisecting the angle, doubling the 
cube, and squaring the circle — are impossible to solve by straight- 
edge and compass constructions. (Here we completely prove the first 
two of these but content ourselves with quoting Lindemann’s theo- 
rem that 7 is transcendental in proving the third.) Finally, we give a 
proof of the fundamental theorem of algebra. Despite its name, this 
theorem cannot have a purely algebraic proof, as the real and com- 
plex numbers cannot be constructed purely algebraically, but we give 
a proof that uses the irreducible minimum of analytic results — only 
the theorem that a polynomial of odd degree with real coefficients 
must have a real roots — but otherwise is entirely algebraic, using 
Galois theory and group theory. Throughout this chapter we heavily 
use the viewpoint that an extension E of a field F is an F-vector 
space, and so, as we have said, we are presupposing familiarity with 
vector spaces in general. 

We conclude in Chapter 5 by studying Dedekind rings. Logically 
speaking, this could be part of Chapter 3, but pedagogically speaking, 
we feel it would be a mistake to put it there, as at that point we would 
have no examples to work with. But, having developed field theory 
in Chapter 4, we have rings of integers in algebraic number fields as 
examples, and we first prove that these are always Dedekind rings. 
Then we prove the main result about Dedekind rings, that nonzero 
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ideals have unique factorization as a product of prime ideals, and then 
we give concrete examples chosen from rings of algebraic integers. 

There are two appendices. To get started in Chapter 2 with group 
theory, we need to know basic properties of the integers (e.g., primes 
and unique factorization). But these results are part of ring theory, 
which we do not treat until Chapter 3. So in Appendix A we simply 
state these results, in order to have them available at the start when 
we need them. Of course, we do prove them in Chapter 3, and indeed 
in a more general context. Our proof of the theorem of the primitive 
element in Chapter 4 uses a result from linear algebra that, although 
standard, is not always presented, so in Appendix B we provide the 
statement and a proof of this result in order to have it available as 
well. 

Each chapter concludes with a variety of exercises ranging from 
the straight-forward to the challenging. Some of these are particular 
examples that illustrate the theory while others are general results 
that develop the theory further. 

Finally, some remarks on numbering and notation: We use three- 
level numbering, so that, for example, Theorem 4.10.3 is the third 
numbered item in Chapter 4, Section 10. We denote the end of proofs 
by UO, as usual. Theorems, etc., are set in italics, so are demarcated 
by their typeface. Definitions, etc., are not, so we mark their end by 
©. Our mathematical notation is standard, though we want to point 
out that if A and B are sets, A C B means that A is a subset of B 
and A C B means that A is a proper subset of B. 

We have enjoyed writing this book and we trust that you will 
enjoy reading it, and thinking deeply about the matter within it, as 
well. 


Steven H. Weintraub 
Bethlehem, PA, USA 
May 2021 
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Chapter 1 


Set Theory 


Set theory is the language of modern mathematics. In fact, this 
language is so engrained, it is hard to imagine doing mathematics 
without using it. (Nevertheless, mathematicians did so for literally 
thousands of years.) 

We regard set theory as a tool and will be developing it with a 
view to using it in our investigations in the subsequent chapters of 
this book. But there are some quite interesting subtleties, especially 
when dealing with infinite sets, and we shall present some of these. 


1.1 Basic set theory 


We begin at the beginning, with the basic definitions of set theory. 


Definition 1.1.1. A set A is a collection of objects called its ele- 
ments. We write a € A to mean that the object a is an element of 
the set A. We write A = {a,b,c,...} to mean that a,b,c,... are the 
elements of A. ) 


We will often consider that our sets are contained in some “uni- 
verse” U, consisting of all possible objects (of whatever sort we are 
considering). 


Example 1.1.2. We have the empty set A = {}, the set having no 
elements. This set is often denoted by ¢. We also have the set A =U 
consisting of all possible elements. © 
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Of course, if these were the only sets we had, set theory would 
not be very interesting or useful. 


Example 1.1.3. We have A = {1,3}, the set whose two elements 
are the integers 1 and 3. © 


Instead of listing elements, we may specify a set by giving a property 
that its elements must satisfy. We let P(x) be a proposition (i.e., a 
true-false statement) involving the variable x. Then, {x | P(x)} is 
the set consisting of those x for which P(x) is true. (The vertical bar 
in this notation is read as “such that” .) 


Example 1.1.4. We have B = {z | x? — 4x + 3 = 0}. .) 


Next, we ask when two subsets are equal. If we regard a set as defined 
by its elements, the answer is forced on us. 


Definition 1.1.5. Two sets A and B are equal, i.e., A = B, if they 
have the same elements, ie., ife#e AS-ae B. © 


Example 1.1.6. The sets A and B of Examples 1.1.3 and 1.1.4 are 
equal. © 


Remark 1.1.7. What does it mean to solve an equation? The set B 
is simply the set of roots of the polynomial x? — 42 +3, so solving the 
equation means explicitly finding the set B of its roots. Mathemati- 
cians were solving equations for thousands of years before set theory 
came along. Indeed, Euclid knew how to solve this equation, while 
set theory was first developed in the late 19th century by Cantor. 
True, in this example, we don’t need to mention the word “set”, as 
we could simply ask to find the roots of this equation. But, in more 
complicated situations, the use of set theory is unavoidable. We want 
to reify (i.e., regard as an object) the roots of equations, for example, 
to be able to better handle and understand them, and the object we 
need to introduce is the set of their roots. © 


Definition 1.1.8. Let A be a set. Its complement A° is the set 
A°=f{xeu|a¢ A}. 


oO 


Example 1.1.9. Suppose / = {all integers}. If A = {a € U | 
x is divisible by 2}, so that A = {even integers}, then AS = 
{odd integers}. v) 
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It is convenient to introduce Venn diagrams, which provide a way 
of visualizing sets and their properties. We have the following Venn 
diagram of a set A, where the shaded area indicates the elements 
of A. 


Then, the shaded area in the following diagram indicates the 
elements of A°. 


Remark 1.1.10. We easily see that ¢° = U, Uo = ¢, and that for 
any set A, (A‘C)° = A. > 


Definition 1.1.11. Let A and B be sets. Then, B is contained in A, 
or isa subset of A, ifx € B= ax € A. We denote this by B C A. Also, 
B is properly contained in A, or is a proper subset of A, if BCA 
and B # A. We denote this by B C A. % 


Remark 1.1.12. Many people write, as we do, B C A to mean that 
Bisasubset of A. But many people denote this by B C A. We prefer 
our notation, as containment /proper containment is analogous to the 
distinction between x < y and x < y. © 


The following is the Venn diagram for this situation. 


4 An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


u 
A 


Lemma 1.1.13. Let A and B be sets. Then, B C A if and only if 
A° CB’. 


Proof. BC A means rt € B > « € A. But this implication is 
logically equivalent to its contrapositive, which is x ¢ A> x ¢ B, 
which means A®° C B*. 


Lemma 1.1.14. Let A and B be sets. Then, A= B if and only if 


(i) AC Band BCA or 
(ii) AC B and A° C BS. 


Proof. If A= B, then certainly both (i) and (ii) are true. 

To show the converse, first note, by Lemma 1.1.13, that conditions 
(i) and (ii) are equivalent. Suppose (i) is true. Since AC B,x € A> 
xé€B.Sine BC A,wxe BS>axeA. Thus,x¢ AS-ze B, and so 
A=B. 


Remark 1.1.15. This innocent looking lemma is the key to proving 
that two sets are equal. That is, if we want to show that A = B, the 
usual way of doing so is to show that A C B and that B C A, or to 
show that A C B and that AS C B*. © 


Now, we come to a pair of operations on sets. 


Definition 1.1.16. (a) Let A and B be sets. Their union AU B is 
the set 


AUB={x|xeEAorzre Bh. 
(b) Let A and B be sets. Their intersection AN B is the set 
ANB={xz|r%eEAand ze B}. ¢) 
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Example 1.1.17. If A = {1,2} and B = {2,3}, then AUB = 
{1,2,3} and AN B= {2}. .) 

The following is the Venn diagram for this situation, where the 
union is the hatched region and the intersection is the doubly hatched 
region. 


u 


Here are some basic properties of these operations. 
Lemma 1.1.18. 


(a) For any set A, AU A° =U and ANAS =¢. 
(b) For any two sets A and B, AUB =A if and only if BC A and 
AN B=A if and only if AC B. 


Proof. Exercise. 


Here are two more properties, which state that each of the oper- 
ations of union and intersection distributes over the other. 


Lemma 1.1.19. Let A,B, and C be sets. Then, 


(a) AU(BNC) =(AUB)N(AUC) and 
(b) AN(BUC) =(ANB)U(ANC). 


Proof. Exercise. 


Here are two more properties, known as De Morgan’s Laws. 
Lemma 1.1.20. Let A and B be sets. Then, 


(a) (AU B)* = ACN BS and 
(b) (AN B)& = ASUBS. 


Proof. Exercise. 
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We introduce a few more definitions. 


Definition 1.1.21. Two sets A and B are disjoint if AN B = @. If 
A and B are disjoint, we may write AU B as AI B and call this set 
the disjoint union of A and B. © 


Definition 1.1.22. If B is a subset of A, then A — B is the set 
A-B={rEA|c¢¢ B}=ANB. rv) 


1.2 Relations, functions, and equivalence relations 


We now want to introduce the general notion of a relation and then 
specialize this notion in two different directions, that of a function 
and that of an equivalence relation. 

Intuitively, what should we mean by a relation ~ between two sets 
X and Y? It is a decision: Given an element x € X and an element 
y € Y, we decide whether x and y are related, and if so, we write x 
~ y. For example, we might let X = {airlines} and Y = {airports}, 
and for an airline xp (ie., an element of X) and an airport yo (ie., 
an element of Y), decide that xo ~ yo if airline xo serves airport yo. 
But of course a “decision” is not a mathematical construct, and so 
we must formalize this notion. 

First, we need the following construction. 


Definition 1.2.1. Let X and Y be sets. Their product X x Y is the 
set 


XxY={(z,y)|re X,y € Y}. 0 
Example 1.2.2. Let X = {1,2} and Y = {a,b,c}. Then, 
Xx Y = {(1,a), (2,4), (1,6), (2,6), (1, €), (2,¢)}- > 


Now, we define a relation. 


Definition 1.2.3. A relation R between X and Y is a subset of 
XxY.Then,r~ pe yif (z,y) € R. » 


Remark 1.2.4. Strictly speaking, R is the relation. But intuitively 
speaking, we start out by deciding when x ~z y, and let R be the 
set of ordered pairs (x,y) for which this is true. Thus, we often call 
R the graph of the relation. © 
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Definition 1.2.5. Let R be a relation between X and Y. 
For zp € X, R(x0) = {y eY | (xo, y) = R}. 
For yo € Y, R7'(yo) = {x € X | (x, yo) € Ry}. © 


We thus see that R(zo) = {y € Y | zo ~ g y} and R“1(yo) = {x € 
X|x~ R yo}, ie., that R(xo) consists of those elements of Y that 
xo is related to and that R~!(yo) consists of those elements of X that 
are related to yo. 

We may construct new relations from old. 


Definition 1.2.6. Let R be a relation between X and Y. Then, its 
inverse R7' is the relation between Y and X given by 


R-1={(y,2) €Y x X| (x,y) € R}. © 


Definition 1.2.7. Let R be a relation between X and Y, and let S 
be a relation between Y and Z. Then, their composition SR is the 
relation between X and Z given by 


SR={(z,z)| for some y € Y, (#,y) € Rand (y,z) € S}. 0 


Actually, it is not so often in mathematics that we need to consider 
relations between X and Y in general. It is much more often the case 
that we want to consider functions from X and Y. 

What is a function? Intuitively, a function f : X — Y is a rule 
that assigns an element of Y to each element of X. If this rule assigns 
yo to xp, we write yo = f(x). Again, we must formalize this notion. 


Definition 1.2.8. A function (or mapping) f : X > Y is a relation 
between X and Y with the property that for every x € X, there is 
exactly one y € Y with (x,y) € f. In this case, we write y = f(x). 


Remark 1.2.9. The situation here is entirely analogous to (in fact, 
a special case of) Remark 1.2.4. Strictly speaking, this subset f of 
X x Y is the function. But intuitively speaking, we start out with 
the rule y = f(x) and let f be the set of ordered pairs for which this 
is true. Thus, we often call this set the graph of the function. 

To see why we use this name, let us look at a very familiar case. 
Let X = Y = {real numbers}, and let f(x) = x?. We identify X x Y 
with the plane in the usual way, and then the set {(x,«7) | « €¢ X} 
is just the graph of this function in the usual sense. 
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Example 1.2.10. Here is a (very simple but) very important func- 
tion. Let X be any set. Then, the :dentity function idx on X is the 
function defined by idx(x) = x for every x € X. (We often denote 
the function simply by id when X is understood.) © 


We have some special types of functions. 
Definition 1.2.11. Let f : X — Y bea function. Then: 


(a) f is one-to-one (1 — 1), or is an injection, if for every y € Y, 
there is at most one x € X with f(x) = y (or, equivalently, 
whenever 11 # 22 © X, f(r1) # f(a2) € Y). 

(b) f is onto, or is a surjection, if for every y € Y, there is at least 
one x € X with f(x) =y. 

(c) f is one-to-one and onto, or is a bijection, if f is both one-to-one 
and onto, i.e., if for every y € Y, there is exactly one x € X with 


f(z) =y. y 


It is illuminating to view f : X — Y as a collection of arrows, 
with an arrow going from x € X to f(x) EY. 


7. 


i 


Definition 1.2.12. Let f = X >~ Y andg=Y — Z be functions. 
Their composition h = gf is the function h = X — Z given by 


h(x) = g(f(@)). 0 


In the “arrow” representation, we have as follows: 


f 8 


> 


Thus, in this representation, h is the double-length arrows. 
Here is the basic fact about composition, which is used through- 
out mathematics. 


Theorem 1.2.13. Composition of functions is associative, t.e., if 
[:X 9Y,9g:Y ~Z, andh: Z > W are functions, then 


A(gf) = (hg) f :X > W. 
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Proof. On the one hand, for any « € X, 


(h(gf))(@) = h(gf(@)) = h(g(F(@))), 


and on the other hand, 
(hg) f)(@) = hg(F(x)) = A@(F())), 


and these are equal. 


In the “arrow” representation, we have 


and h(gf) = (hg)f is given by the triple-length arrows. 

Definition 1.2.14. Let f : X — Y bea function. Then, f is invert- 
ible if there is a function g: Y > X with gf = idx and fg = idy. 
In this case, we call g the inverse of f and write g = f~!. © 


Remark 1.2.15. We observe that this definition is symmetric in 
f and g, so if g = f-}, then f = g (and (f-!)"! = f, 
(g°*)"* =9). > 

There is a very simple criterion for deciding when a function is 
invertible. 


Theorem 1.2.16. Let f: X — Y bea function. Then, f is invertible 
if and only if f is 1 — 1 and onto. 


Proof. First, suppose f is 1 — 1 and onto. Then, for every y € Y, 
there is a unique element x € X with f(x) = y. Thus, setting 
gly) = @ if f(z) = y, 

we have a well-defined function, and we see that gf (a) = x, for every 
x eX and fg(y) = y, for every y€ Y,ie., gf = idx and fg = idy, 
so f is invertible and f~! = g. 

On the other hand, suppose that f is invertible and let g = f~!. 
First, we show f is onto: Let y € Y be arbitrary. Set x = g(y). Then, 


y = idy(y) = fay) = flo(y)) = F(@). 
Next, we show f is 1 — 1: Let y € Y be arbitrary and suppose 

y = f (x1) = f(x2). Then, 
gf (x1) = gf (2), idx (a1) = idx (x2), v1 = 22. 
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In the “arrow” representation, in case f is invertible, f~! is 
obtained from f by reversing the direction of the arrows. 


tg 


ae 


fol 
<—_—____ 


Now, we shift gears and look at a different set of issues about 
relations. 


Definition 1.2.17. Let R be a relation on X (i.e., between X and 
X). Then: 


(i) Ris reflexive if (x,x) € R for every x € X. 
(ii) R is symmetric if (x,y) € R implies (y, x) € R. 
(iii) R is transitive if (x,y) € R and (y, z) € R implies (z,z) E R. > 


Remark 1.2.18. This is the formal definition. But we usually think 
of it informally: 


(i) R is reflexive if x ~p a for every x € X. 
(ii) R is symmetric if whenever x ~p y, then y ~p @. 
(iii) R is transitive if whenever 7 ~p y and y~p z, then xz ~Rz. 9 


We can ask the following three yes/no questions: Is R reflexive? Is 
R symmetric? Is R transitive? A priori, there are eight possibilities 
for the answers, and in fact they all occur. (See Exercises.) But the 
case when all these answers are yes is a particularly important one, 
which we now focus on. 


Definition 1.2.19. Let R be a relation on a set X that is reflex- 
ive, symmetric, and transitive. Then, R is called an equivalence 
relation. v) 


Example 1.2.20. The archetype of an equivalence relation is equal- 
ity, ie., © ~p y if « = y. This is where the name comes from. But 
there are many others. © 
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Example 1.2.21. Consider the relation on people: Person 7 ~ 
Person y if « and y have the same birthday. This is an equivalence 
relation. v) 


To state the basic property of an equivalence relation, we need to 
make two preliminary definitions. 


Definition 1.2.22. Let ~ be an equivalence relation on X. For 29 € 
X, the equivalence class of x is [xo] = {y € X | zo ~ y}. Any element 
of an equivalence class is called a representative of that equivalence 
class. 0) 


Lemma 1.2.23. Let ~ be an equivalence relation on X, and let A 
be a subset of X. Then, A is an equivalence class of ~ if and only if 


(i) A is nonempty, 
(ii) c~y for every x,y € A, and 
(iii) « 4 z for everyx Ee A, z EA. 


Proof. Suppose A is an equivalence class. Then, by definition, it is 
[xo] for some 2g € X. Since ~ is reflexive, ro ~ Xo, and so 2 € [x0] 
and A is nonempty. Now, let x,y € A. By definition, 79 ~ x and 
ro ~ y. Since ~ is symmetric, x ~ ro and then, since ~ is transitive, 
x~ y. Also, let x € A and z ¢ A. Suppose x ~ z. Since zp ~ x, we 
have xg ~ z and so z € A, a contradiction. 

On the other hand, let A be a set satisfying (i), (ii), and (iii). 
Since A is nonempty, there is some xp € A. We claim A = [xo]. First, 
we show A C [zo]: Let y € A. Then, xo ~ y, so y € [xo]. Next, we 
show [xo] C A: Let z € [xo]. If z ¢ A, then zo ~ z with z ¢ A, a 
contradiction. Thus, A = [xo]. 


Corollary 1.2.24. Let ~ be an equivalence relation on X, and let 
A and B be equivalence classes of ~. Then, A and B are either 
identical or disjoint. Equivalently, let A and B be equivalence classes 
of ~ and suppose there is some x9 € X with rq € A anda € B. 
Then, A= B. 


Proof. If A and B are disjoint, there is nothing to prove. Suppose 
not. Let 79 € ANB. Let ye A andy’ € B. 

Then, to ~ y (as x,y € A) and ay ~ yy’ (as 20, y' € B), soy~y/. 
But then, by Lemma 1.2.23, y/ € A, so B C A, and similarly y € B, 
so AC B. Hence, A = B. 
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Definition 1.2.25. Let X bea set, and let A), Ao,... be nonempty 

subsets of X. Suppose that these subsets are pairwise disjoint, i.e., 

that A;M A; = ¢ whenever i # j, and that X = U Aj. Then, {Aj, 
t 


Ao,...} partitions X. In this case, we write X = I Aj. © 
1 


Theorem 1.2.26. Let X be a set, and let {A,, Ag,...} be a partition 
of X. Then, x ~ y if x € A; and y © A; for some i is an equivalence 
relation on X. 

Conversely, let ~ be an equivalence relation on X. Then {distinct 
equivalence classes of ~} is a partition of X. 


Proof. Clearly, c ~ y if x,y € A; for some i is an equivalence 
relation on X. 

Conversely, let A,, Ag,... be the distinct equivalence classes of 
~. As we have seen, each equivalence class is nonempty. Since for any 
xe X, x € |az], we certainly have X = ae A;. But then X = UAi 


as in restricting the union to the distinct equivalence classes, we are 

merely eliminating duplications. But by Corollary 1.2.24, the distinct 

equivalence classes are disjoint, so X = LI Aj. 
7 


Definition 1.2.27. Let {Aji, Ag,...} be the set of distinct equiva- 
lence classes of ~ on X. A set {a1,a@2,...} of elements of X with 
a; € Aj, for each 2, is called a complete set of representatives of the 
equivalence classes. .) 


Example 1.2.28. Let X = {people}, and let ~ be the equivalence 
relation on X of having the same birthday. Then, the set of dis- 
tinct equivalence classes is {{people whose birthday is January 1}, 
{people whose birthday is January 2}, {people whose birthday is 
January 3}, ...}. If Alice’s birthday is January 1, Bob’s birthday is 
January 2, Charlotte’s birthday is January 3,..., then {Alice, Bob, 
Charlotte, ...} is a complete set of representatives of ~. But also, 
if Arthur’s birthday is January 1, Brenda’s birthday is January 2, 
Charlie’s birthday is January 3, ..., then {Arthur, Brenda, Charlie, 
... } is also a complete set of representatives. The point is that rep- 
resentatives are almost never unique, and that there is no a priori 
reason to prefer one representative to another. © 
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The corresponding Venn diagram (enhanced with representatives) 
is as follows. 


e@a> xX 


Ay AwNYA3 


a3 
ea 


1.3 Cardinality 


In this section, we want to investigate the cardinality, or “size”, of 
sets. We start off with a very simple case. 

Let S be a finite set, i.e., a set consisting of a finite number of 
elements. Then, the cardinality of S', # (5), is the number of elements 
of S. In other words, if S has a finite number k of elements, then # 
(S) = k. For example, # (¢) = 0, # ({banana}) = 1, # ({apple, 
orange}) = 2, # ({tomato, cucumber}) = 2. 

We are now going to make this simple situation a whole lot more 
complicated. We do this because we have to develop a framework 
that we can use to investigate the cardinality of arbitrary sets. 

We let IN = {positive integers}. We use this symbol as the positive 
integers are sometimes called the natural numbers. 

For a nonnegative integer k, we let Ny, = {1 © N|i< k}. Thus, 
INo =; and for k > 0, Ng ={1 2... «23 h}- 

In this section, we call a bijection a 1 — 1 correspondence. 

A moment’s thought shows that we can recast our simple notion 
of size above in the following form. 


Definition 1.3.1. Let S bea set. Then, S' is a finite set if there is a 
1 — 1 correspondence f: IN, — S for some nonnegative integer k. In 
this case, we say the cardinality of S, # (S') =k. % 


Remark 1.3.2. Strictly speaking, in order to know that this defi- 
nition makes sense, we must know that if 7 and k are nonnegative 


14 An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


integers with j # k, then there is no bijection between IN; and NN,. 
We leave this as an exercise. © 


Thus, we see, for example, that # ({apple, orange}) = 2 because 
we have a 1 — 1 correspondence f: Nz = {1, 2} — {apple, orange} 
given by f(1) = apple, f(2) = orange. Similarly, # ({tomato, cucum- 
ber}) = 2 because we have a 1 — 1 correspondence g: Nz — {tomato, 
cucumber} given by g(1) = tomato, g(2) = cucumber. 

Now, these two sets have the same cardinality (namely 2), and we 
have a 1 — 1 correspondence h: {apple, orange} > {tomato, cucum- 
ber} given by h = gf~!. Then, h(apple) = tomato and h(orange) = 
cucumber. But now, we can eliminate the middleman, and we are led 
to the following definition. 


Definition 1.3.3. Let S and 7 be sets. Then, S and T have the 
same cardinality, # (S) = # (T), if there is a 1 — 1 correspondence 
A:S—> T. v) 


Lemma 1.3.4. Having the same cardinality is an equivalence rela- 
tion on sets. 


Proof. Exercise. 


Definition 1.3.5. A set S is countably infinite, or simply countable, 
if there is a 1 — 1 correspondence f: IN > S. © 


Remark 1.3.6. We use the term because in this situation, we can 
simply count the elements of S : f(1) is the first element of S, f (2) 
is the second element of S, f(3) is the third element of S,.... .) 


Example 1.3.7. Let a,,a2,a3,... be any infinite sequence all of 
whose elements are distinct. Then, the set A = {a1,a2,a3,...} is 
countable as we have a 1 — 1 correspondence f: IN > A given by 


fi) = a. 0 


Remark 1.3.8. We use this example in the opposite direction. If 
we have an infinite set and we can list its elements in order, then 
we have a 1 — 1 correspondence between that set and IN, so that the 
set is countable. © 


We think of the cardinality of a set S as its size, so to say that 
# (S) = # (T) is to say that S and T have the same size. But this 
definition, which is forced on us, has some surprising consequences. 
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Example 1.3.9. (a) The set A = {even positive integers} is count- 
able, as we have the 1 — 1 correspondence f: IN — A given by 
f(n) = 2n. Thus, even though we might think that IN is twice as 
big as A, in fact they have the same size. 

In light of Remark 1.3.8, we could have shown that A was count- 
able simply by listing the elements of A in order, A = {2,4,6,8,... }, 
rather than writing down the function f, and that is what we will 
do in the remaining parts of this example. 

(b) The set A = {nonnegative integers} is countable as we may 
list the elements of A as A = {0,1,2,3,... }. 

(c) The set Z = {all integers} is countable as we may list the 
elements of Z as {0, 1, —1, 2, —2, 3, —3, ...}. 

(d) Consider the set IN x IN = {(2,7) | i € IN, 7 € N}. This set 
contains infinitely many copies of IN, the subsets {(7, 70) | i € IN} for 
any fixed 7 € IN, so we might suppose its cardinality is greater than 
that of IN. But this set, too, is countable. In the following figure, we 
let (2,7) denote the point with those coordinates in the plane, and 
next to each such point, we write the number of that point in a listing 
of IN x NN. 


1 
5- > 
10 14 
4- e e 
ra 0 2S 13 
7 Pe oo m) 12 
1 1 2 4 7 11 
4 e e e e 
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To clarify, this gives the listing of IN x IN as IN x WN = {(1,1), 
(2, 1), (1, 2), (3, 1), (2, 2), (1, 3), ...}, amd we see that IN x 
IN is countable. v) 


At this point, you may wonder if there are any sets that are 
uncountable (i.e., not countable). The answer is yes. 


Theorem 1.3.10. (Cantor) R = {real numbers} is uncountable. 


Proof. We must show that there is no 1 — 1 correspondence from 
IN to R, or equivalently, that no matter how we list the elements of 
R in order, our list will not contain all of them. 

We can regard real numbers as given by their decimal expansions, 
and the decimal expansion of a real number is unique, except for one 
ambiguity: a decimal expansion that ends in an infinite sequence of 
9’s. (For example, .999...= 1.) So, we never use such an expansion. 

Thus, suppose we list the real numbers: 


1.1 hk 1 1 1 1 1 
Ty, = €1...€3 Ag A, Ag*A_; A_g G_3 a_4, 


_ 2 42 a2 72). we 2 2 2 
T2 = €2...43 a5 ay ag: a 1 a 2 a 3 a 4 


3° po: Ad (Ao:, Ao 3 3 3 
T3 = €3...43 a5 ay ag: a ‘il a 9 a 3 a 4> 


= 4 4 4 4 4 4 4 4 
T4 = €4...43 a5 ay a9°4_1 Ag 4_3 G_y, 


where ¢; is a sign (€; = +1) and the ai, are decimal digits. 

Form the real number s = .b_1b_9b_3b_4..., where b_; # Chae 
and b_; £ 9. (The restriction b_; 4 9 is to guarantee we never end s 
with 999...) 

Then, s is not an element of the list {r1,r2,7r3,...} as s differs 


from r; in the ith digit following the decimal point, for every 7. 


Remark 1.3.11. The argument in this proof is known as Cantor’s 
diagonal argument. © 


We can apply a similar idea in more general circumstances. First, 
we need a general definition. 


Definition 1.3.12. Let S and T be sets. Then, # (S) < # (T) if 
there is a 1 — 1 correspondence from S to a subset of T’, and # (5) < 


Set Theory Li 


# (T) if # (S) < # (T) and # (S) # # (7), ie., if there is a 1 — 
1 correspondence from S to a subset of 7’, but there is no 1 — 1 
correspondence from S' to T. © 


Theorem 1.3.13. Let S be any set, and let P(S') be the power set 
of S, 
P(S) = {all subsets of S}. 


Then, # (S) < # (P¢S)). 


Proof. If S is a finite set, this is easy: If # (S) = k, then 
# (P(S)) = 2*, and k < 2* for any nonnegative integer k. 

Let S be an arbitrary set. First, we have a 1 — 1 correspondence 
e from S to a subset of P(S) given by e(s) = {s} for s € S. 

To show that there is no 1 — 1 correspondence f : S + P(S), we 
argue by contradiction. Suppose we have such a correspondence. Let 


R={seS|s¢€ f(s)}. 


R is a subset of S, and since a 1 — 1 correspondence is onto, we 

must have R = f(so) for some spo € S. We ask whether so € R. 
If so € R, then by the defining property of R, so ¢ f(so) = R. 
If so € R= f(so), then by the defining property of R, so € R. 
Thus, in either case, we have a contradiction. 


Example 1.3.14. Let us now consider Q; = {positive rational num- 
bers}. Note that we can write every element r of Q; as r = p/q, a 
quotient of positive integers, with this fraction in lowest terms. We 
claim that Q+ is countable. Let us try to show this. 

Before doing so, let us observe that certainly # (IN) < # (Q,) as 
we have the 1 — 1 function f: IN > Q4 given by f(n) = n/1. 

But let us try to come up with a 1 — 1 correspondence from Q, to 
NN. To do so, let us return to Example 1.3.9 (d), where we considered 
IN x NN. Let us consider Q;, where we identify r = p/q with the 
point with coordinates (p,q). Note that we do not get all points of 
IN x N. For example, we do not get the point (2,2) as 2/2 is not a 
fraction in lowest terms. Thus, in that example, let us simply erase 
the points of IN x IN that we don’t get and keep the labels on the 
points that we do. We have the following figure. 
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5- eh 

10 
4- e 
34 ° - 
2-4 re °° 

1 2 4 7 ll 
1- e e e e 

T T T T 1 

1 2 3 4 5 


Now, taking a lattice point to its label, we have a 1 — 1 function 
g from Q+ to N: g(1,1) = 1, g(2,1) = 2, 9,2) = 3, g(3,1) = 
4, g(1,3) =6,... 

Note that this function g is not onto IN as the set of its values is 
(1,9. 5,46; 7,8,9,10, 1 Tecan P 

Thus, we have succeeded in showing that # (Q,) < # (IN), but 
we have not shown that # (Q,) = # (NN). 


Now, perhaps we could be more clever and construct a 1 — 1 
correspondence between IN and Q;. But this would not help us very 
much, except in this one example. A theory of cardinalities that 
required us to be clever each time we wanted to show # (S) = # (T) 
would not be a very useful theory. So, it is very useful to have the 
following result. (You can’t get something for nothing, so, as you 
will see, this theorem that avoids the necessity of being clever in 
particular examples itself has a very clever proof.) 


Theorem 1.3.15 (Schréder—Bernstein). Let S and T be sets 
with # (S) < # (T) and # (T) < # (S). Then, # (8) = # (T). 
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Proof (K6nig). We first prove this in the case where S and T are 
disjoint. This involves all the hard work (and cleverness). Once we 
have done that, we will see how to easily extend the proof to handle 
S and T in general. 

Thus, let us suppose that S and T are disjoint sets and that we 
have a 1 — 1 correspondence f between S and a subset of J, and a 
1 — 1 correspondence g between T and a subset of S. 

For any element so of S, we may form the “string” 


which is defined by to = f (so), $1 = g(to), t1 = f (81), 52 = g(t), t2 = 
F(s2),.... 

Note that this string extends infinitely far to the right, as we may 
always keep applying f or g as the case may be. 

We now try to extend this string to the left. Consider so. Now, g 
is a 1 — 1 function but is not necessarily onto. 

If so # g(t) for any t € T, then we cannot extend the string 
past so to the left. But if s9 = g(t) for some t € T, there is exactly 
one such t, which we call t_;. Thus, s9 = g(t_1) or t-1 = g~1(s0). 
Now, consider t_,;. By the same logic, either there is no s € S with 
f(s) = t_1 or there is exactly one such s, which we call s_;. Then, 
tar = f(s_-1) or S_,j = fC 4). 

We keep going with this procedure as long as we can. Perhaps 
this goes on forever, or perhaps not. So, our string becomes 


=i -1 
a < me aes SO tt > S] >ty PREG og 


where we do not know if, or where, it stops on the left. 
Note that we can rewrite this string as 


Note also that we have simply labeled the arrows for emphasis. 
A right-pointing arrow must have a label f (resp. g) if it leads 
away from an element of S (resp. T'), and a left-pointing arrow must 
have a label g~! (resp. f~!) if it leads away from an element of S 
(resp. T). 

We claim that {distinct strings} is a partition of SUT. 
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To show this, we must show (i) that every element of SU T is in 
some string and (ii) that distinct strings are disjoint. To show (i), we 
first show that every element of S is in some string and then show 
that every element of T is in some string. 

Let s € S. Then, we have the string obtained by choosing sg = s, 
and that string contains s in position sp. Let t € T. Set s = g(t). 
Then, we have the string obtained by choosing sg = s, and that string 
contains t in position t_;. This shows (i). Now, we observe that any 
string is entirely determined by any element of it. Certainly, it is 
determined by sg, as from the first way of writing the string, we see 
that any element of the string is at the head of a sequence of arrows 
emanating from sg. But now, note that we can turn the direction 
of arrows around, as we did to get to the second way of writing the 
string, so by the same logic the string is entirely determined by any 
s; or t;. From this, we see that any two strings that have an element 
in common must be identical. (We need not be concerned about the 
position of that element in each of the strings, as a string always 
goes infinitely far to the right and always goes as far to the left as 
possible.) This shows (ii). 

Now, let us consider {distinct strings}. We will see that any string 
is of one of the four types. 

How can a string behave? Here is one possibility: 


Type I: The string goes infinitely far to the left (and of course to 
the right) with no repetition of elements. 

Here is the next possibility: The string has a repeated element. 
Choose a pair of repeated elements that are as close together as 
possible, say s;1,% = 5;. (As we will see momentarily, the argument is 
the same if some t element is repeated.) But then, tii, = f(si+%) = 
f(si) = ti, and then si4441 = g(titeh) = g(ti) = Si41, etc., So we 
have a repetition of both s and t elements to the right. But also, 
8; = S44 = g(ti4~—1), SO We can continue the string to the left of s; 
with t;-1 = tj+,4_1 (as tj_1 is defined by the equation g(t;_1) = s;). 
But then, by the same logic, s;-1 = $;4,~1, So we have a repetition 
of both s and t elements to the left. This gives the second possibility: 


Type II: For some k, the string contains the substring 


So > to... Sky —> try 
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with no repetition, but g(tx,-1) = so, and the string goes infinitely 

far to the left (and of course to the right) repeating this substring. 
This takes care of the cases in which the string extends infinitely 

far to the left. Thus, what remains are the cases in which it does not. 
This gives the remaining two types: 


Type III: For some N, a string that begins on the left with s_j 
(and of course continues infinitely far to the right), i.e., a string of 
the form 


gp gp ee as ee a ar Stee 


Type IV: For some M, a string that begins on the left with t_jy_, 
(and continues infinitely far to the right), i.e., a string of the form 


= = = 
jaya ar! t_ué Sumy = 


Now, having done all this hard work, we can now easily define 
a 1 — 1 correspondence h : S — T. The correspondence is given as 
follows: 


If s is an element of a type-I string, h(s) = f(s). 

If s = so,...,5,-1 in a type-II string, h(s) = f(s). 

If s is an plement of a type-II] string, h(s) = i. a 

If s is an element of a type-IV string, h(s) = g~'(s). 


Then, we are done! 

It remains to handle the case where S and T are not disjoint. Let 
So = Sx {0} = {(s,0) |s € Shand T; = Tx {1} = {(¢,1) | t € Th. 

Then, we have a 1 — 1 correspondence i : S + So given by i(s) = 
(s,0) and a 1 — 1 correspondence j : T — T; given by j(t) = (¢t,1). 
Note that So and T) are disjoint. 

Suppose we have a 1 — 1 correspondence f between S' and a subset 
of T. This gives us a 1 — 1 correspondence f between Sp and a subset 
of T; defined by f(s,0) = (f(s), 1). Similarly, a 1 — 1 correspondence 
g between T and a subset of S' gives a 1 — 1 correspondence g between 
T, and a subset of So defined by 9(t,1) = (g(t),0). Now, we apply 
the Schréder—Bernstein theorem to the disjoint case (which we have 
just proved) to obtain a 1 — 1 correspondence h: Sp + T;. But then, 
h=j thi: S > T is a 1 — 1 correspondence. 


Example 1.3.16. Let S = IN and T = Q,. In Example 1.3.14, 
we constructed a 1 — 1 correspondence f from S to a subset of 
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T and a 1 — 1 correspondence g from T' to a subset of S. Now, 
by the Schréder—Bernstein theorem, we can conclude that S$ and T 
have the same cardinality, i.e., that # (Q,) = # (IN) (and so Q, is 
countable). ?) 


Example 1.3.17. Since the proof of the Schréder—Bernstein the- 
orem was so tricky, it is worthwhile to follow the proof in con- 
structing a 1 — 1 correspondence from IN to Q,. It turns out to be 
more illuminating to proceed differently than in Examples 1.3.14 and 
1.3.16. 


We let IN = {positive integers} = {1,2,3,... } and Q; = {positive 
rational numbers} = {p/q | p,q positive integers, fraction in lowest 
terms}. In particular, Q; has the elements 1/1, 2/1, 3/1, ..., and we 
do not identify these with 1, 2, 3, ..., so that IN and Q, are disjoint. 

We let f: IN > Q4 be the function f(n) = n/1, so that f is a 1 — 
1 correspondence between IN and a subset of Q,, and we let g: Q4 > 
IN be the function g(p/q) = 2?37, so that g is a 1 — 1 correspondence 
between Q, and a subset of IN. 

We look at the strings in the proof of the Schréder—Bernstein 
theorem. For convenience, we label the string with sy) = n by S,. We 
have the strings 


Sti Sais 2-3 =6 >] 61S 2 3 = 199 Ss, 
So 1249/1 34-319 9 10/1 3 9 33 = 1908s 4 s~.; 


S;:5—4 5/1 4 32:3= 96 | 96/1 = 2.3 =... 


and note that these strings do not extend to the left. 
Then, we start the next string as 


636/142 3 =199 3 0/140" 3, 


but now we observe this string does extend to the left and we obtain 
the string 


Geil ijl 36S 6/13 2" 3 = i192 = 100/12 gS, 


so we see that Sg = Sj. 
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Then, 57,...,51, are all distinct, but we see Sjz2 = Sy. Then, 
S'13,..., 517 are all distinct. 
Then, we start the next string as 


18 5 18/14" <3 = 7e6432 — 786190 /1 4 og ee, 


but now we observe that this string does extend to the left and we 
obtain the string 


Sig: 1/2 + 18 > 18/1 > 2!8 .3 = 786432 — 786432/1 


_, 9786432 3 


Note that S;,...,517 are all type-III strings, while Sig is a type- 
IV string. We continue in this fashion to obtain type-III strings 
Si9,---,535 (with So4 = $4) and the next type-IV string is 


Bap 2 2/9 36 — 56/1 2°" 3 See, 


Given these strings, we can now define the 1 — 1 correspondence 
h: IN + Q,. We give a table of values of h(n) for selected values of n. 


nn) 


a aT 


ae! ae 


ee ee 
192 192/1 

Sas ee 
12288 | 12288/1 

= aaa 
786432 | 18/1 


1.4 Naive and not-so-naive set theory 


In our development of set theory, one of the ways we have obtained 
sets is by specifying a property that defines its elements. That is, 
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if P(x) is a proposition (i.e., a true-false statement) involving the 
variable x, then we have the set A = {x | P(x) is true}. For example, 


A = {integers zx | x is divisible by 2} 
defines the set of even integers, A = {..., —4, —2, 0, 2, 4,....}, 
B = {integers x | x = y? for some integer y} 


defines the set of perfect squares B = {0, 1, 4, 9, 16, ...}, and 
if p(z) = anz" + an_12""! +--+ 4,2 + a9 is a polynomial with 
complex coefficients, C = {complex numbers z | p(z) = 0} is the set 
C = {roots of p(z)}, even if we cannot find these roots. We will be 
using this construction throughout this book, and indeed it is used 
throughout mathematics. 

Proceeding in this way is sometimes called naive set theory 
because of the (naive) belief that this always makes sense. Unfor- 
tunately, it does not. 


Example 1.4.1 (Russell’s paradox). Is the class of all classes 
that are not members of themselves a member of itself? If the answer 
is yes, i.e., if it is a member of itself, then, since this class consists of 
all classes that are not members of themselves, the answer is no, it is 
not a member of itself. If the answer is no, i.e., it is not a member of 
itself, then, since this class consists of all classes that are not mem- 
bers of themselves, the answer is yes, it is a member of itself. Thus, 
either answer to this question leads to a contradiction. © 


Thinking about the example, we see that the problem was caused 
by considering the proposition “2 ¢ 2”. So, we cannot allow 
all propositions. 

Having shown you that there is a problem, I am not going to 
show you what the solution is. That would take us very far afield. 
I am simply going to tell you that there is a solution, which you 
may look up if you wish. (Most) mathematicians operate under ZF 
set theory, where ZF stands for Zermelo—Fraenkel. ZF set theory 
ensures us that {a | P(x)} makes sense for all the propositions P(x) 
we normally encounter in doing mathematics. 

ZF is a set of axioms for set theory. There is an additional axiom. 
In the statement of this axiom, J is an arbitrary indexing set. 
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Axiom of choice: Let {X;}ie7 be pairwise disjoint nonempty sets. 
Then, there is a set A = {A;}ic7 with a; € X; for eachi € TI. 


This axiom gets its name because we think of obtaining A by choos- 
ing one element from each set. If the indexing set J is finite, there 
is no problem, as we simply choose the elements one at a time. But 
mathematics is a finite process, so we cannot do that if the indexing 
set is infinite, so we need an axiom to guarantee the existence of 
A. It turns out that this axiom is independent of the axioms of ZF. 
(Most) mathematicians accept this axiom and operate under ZFC, 
which is ZF plus the axiom of choice. (The axiom of choice has many 
equivalent formulations. The most useful in constructing mathemat- 
ical proofs is Zorn’s lemma, which I am not going to state here but 
which you may look up if you wish.) 

Finally, a few words about cardinalities of sets. Recall that we 
showed #(IN) < #(JR). We have the famous 


Continuum hypothesis: There is no set S with # (IN) < # 
(S) < # (R). 


It is known that the continuum hypothesis is independent of 
the axioms of ZFC. 


1.5 Exercises 


1. Prove Lemma 1.1.18: 
(a) AUAS =U, ANAS =¢. 
(b) AUB=ASBCA,ANB=ASACB. 
2. Prove the distributive laws (Lemma 1.1.19): 
(a) (AUB)NC=(ANC)U(BNC). 
(b) (AN B)UC=(AUC)N (BUC). 
3. Prove De Morgan’s laws (Lemma 1.1.20): 
(a) (AU B)® = ASN BS. 
(b) (AN B)f = ASUBS. 
4, Show that AUB=AI(B-A)=BU(A-B) 
= (A- B)U(B-A)U(ANB). 
5. The symmetric difference of two sets A and B is 
AxB = (A- B)U(B-A). 


26 


10. 


An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


Show that: 

(a) AxB = (AN B°)U (ASN B). 
(b) (AxB)* = (AN B)U (ASN B?). 
(c) (AxB)*C = Ax(BxC). 


. Let A and B be subsets of X, and let C and D be subsets of Y. 


Show that: 
(a) (ANB) x (COND) =(AxC)N(Bx D). 
(b) (AUB) x (CUD) = (Ax C)U(Ax D)U(Bx C)U(Bx D). 


. Let f : X + Y bea mapping. Let A and B be subsets of X, 


and let C and D be subsets of Y. In each case, prove or find a 
counterexample: 


(a) f(AU B) = f(A) U f(B). 

(b) f(AN B) = f(A) f(B). 

(c) f-(f(A)) = 

(a!) f--(CUD) = f-(C)U f(D). 
(b') f-\(C ND) = f-(C)N f-*(D). 


(c') F(F-(C)) =C. 


. Let ~ zg be a relation on Z. For each of the eight possibilities, 


~ R reflexive: yes/no, 

~ Rr symmetric: yes/no, 

~ p transitive: yes/no. 

Find a relation ~ zg with these properties, except in the case 
yes/yes/yes, find a relation ~ g other than equality. 


. (a) Let A and B be finite sets. Show that 


#(AUB) = #(A) + #(B)— # (AN B). 
(b) Let A, B, and C be finite sets. Show that 

#(AUBUC) = #(A)+#(B) + #(C) — #(ANB)- #H(ANC)— 
#(BNC)+#(AN BNC). 

(c) This generalizes to any numbers of sets: Let Aj,...,An be 
finite sets. Show that 

#(A1 U...U An) = 0ig(-1) #1 ¥( As), 


when the sum is over nonempty subsets S of {1,...,n}, and if 
S = {i1,...%,} is such a set, Ag = Ay N---N Ajg. 

This is known as the inclusion—exclusion principle. 

(a) Let A and B be finite sets. Show that 


#(A x B)= # (A) # (B). 
(b) Let Ay,...A,, be finite sets. Show that 
#:(Ay x... X An) = #(A1)--- #(An). 


11. 
12. 


13. 
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If A is a finite set, # (A) =n. Show that # (P(A)) = 2”. 
Prove Lemma 1.3.4: Having the same cardinality is an equiva- 
lence relation on sets. 
In each case, we have al—1map f:A— B. Findal-1 map 
g : B — A. (Then, by the Schréder—Bernstein theorem, these 
two sets here the same cardinality.) 
(a) A=N, B = {finite subsets of N}, f(n) = {nb}. 
) = a, 
(¢) AjS(eeR | Veal, BH, fie) Hx 
) A=R, B = {finite subsets of R}, f(x) = {z}. 
) 


{z,v+1,x4+2,...}. 
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Chapter 2 


Group Theory 


Sets are sets, period. In other words, they have no additional struc- 
ture. Groups are sets with additional structure, that of an operation 
(satisfying certain properties). Logically speaking, these are what we 
should consider next. Afterwards, we will consider rings and fields, 
which are sets with two operations (again satisfying certain proper- 
ties). But groups are very important in their own right — indeed, 
you have certainly already seen them, even if you have not seen them 
by name — and there is a lot to say about them. 

We will consider groups in general. But we will pay particular 
attention to finite groups, where we can say a lot about their struc- 
ture. However, we will not only completely determine the structure of 
finite abelian groups, but also of infinite, though finitely generated, 
abelian groups. 


2.1 Definition, examples, and basic properties 


We begin by defining groups. 


Definition 2.1.1. A group (G, -) is a set of G with an operation 
(that is, a function) - on G x G satisfying the following properties: 


0) (Closure) If a,b€ G, a-bEG. 

1) (Associativity) If a,b,c € G, (a-b)-c=a-(b-c). 

2) (Identity) There is an identity element e € G, i.e., an element e 
such that e-a =a-e=a for every ae G. 


( 
( 
( 
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(3) (Inverse) Every a € G has an inverse element a7! € G, i.e., an 
element a~! such that a-a~' =a7!-a=e foreveryacG. 9 


Definition 2.1.2. An abelian group (G,-) is a group (G,-) with the 
additional property: 


(4) (Commutativity) If a,b € G,a-b=b-a. .) 


As a matter of language, when the operation is understood, we 
will refer to the group G rather than to the group (G,-). 

As a matter of notation, when the operation is understood, we 
write ab instead of a- b. In dealing with abelian groups, we will 
sometimes write ab and sometimes write a + b. 

Here is the very first invariant of a group. 


Definition 2.1.3. Let G be a group. The order of G is |G| = #(G). 
G is a finite or infinite group as |G| is finite or infinite. % 


Now let us see a bunch of examples. 
Example 2.1.4. 


(a) We have the trivial group G = {e}, consisting of the single ele- 
ment e (so |G| = 1) with the operation e- e = e. We also have 
the following trivial groups: ({1},-) and ({0},+), where - and + 
are the usual multiplication and addition respectively. 

(b) Let Z denote the integers, Q the rational numbers, R the real 
numbers, and C the complex numbers. Then if R = Z,Q,R, 
or C, (R,+) is an abelian group. If R = Z, let R* = {+1}. If 
R= Q,R, or C, let R* = R— {0}. Then (R*,-) is an abelian 
group. 

(c) Let S = {z € C| |z| = 1}. S is the unit circle in the complex 
plane. Then S is an abelian group under multiplication. 

(d) Let n be a positive integer and let W, = {exp(27ik)/n|k = 
0,...,2—1}. Then W,, is the set of complex n-th roots of 1, and 
W,, is an abelian group under multiplication. Note W,, is a group 
of order n. 

(e) Let R be as in part (b) and let My »(R) = {m-by-n matrices 
with elements in R}. Then M,,(R) is an abelian group under 
matrix addition. 

(f) Let R be as above and let GL,,(R) = {invertible n-by-n matrices 
with elements in R}. Then GL,,(R) is a group under multipli- 
cation and is not abelian for n > 2. (There is one subtlety we 
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should point out here: Invertible means that the inverse is also in 
GL,,(R), i.e., has elements in R. For example, the 1-by-1 matrix 
[2] is in GL,(R) but not in GL,(Z).) 0) 


Here are some basic properties of groups. 
Lemma 2.1.5. Let G be a group. 


(a) The identity element of G is unique. 
(b) For any a € G, its inverse a~' is unique. 


Proof. 


(a) Suppose e and f are two identity elements of G. Then by the 
properties of the identity, e= ef = f. 

(b) Suppose b and c are two inverses of a. Then b = be = b(ac) = 
(taje=ee= 6 


Lemma 2.1.6. Let a be an element of a group G. If ab = e, or 
ba =e, thenb=a7!. 


Proof. We do the first case. We have b = eb = (a ta)b = 


a (ab) =e 'e=a +. 


Remark 2.1.7. Associativity says that if we have any three elements 
a,b, and c of G, then (ab)c = a(bc). In other words, we may regroup 
parentheses as we wish without changing the value of the product. 
This says we may eliminate the parentheses entirely and write the 
product simply as abc. The same is true no matter how many ele- 
ments we have. For example, ((ab)c)(de) = a(((bc)d)e), and we may 
write this common product as abcde. This is quite tedious to prove in 
general — in fact, it is even tedious to state it precisely — so rather 
than a formal statement and proof, we will content ourselves with 
this remark. © 


Lemma 2.1.8. Let G be a group. 


(a) 2 =< 
(b) For any element a of G, (a~!)"! =a. 
(c) For any two elements a and b of G, (ab)~' =b-ta7}. 


Proof. 


(a) ee=e. 
(b) (a a= a(a") =e. 
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(c) We simply compute 
(ab)(b-*a-") = a(bb)a! = cea = aa =e 


and 


(b-'a~*)(ab) = b-'(a~'a)b = b 'eb = bb =. 


We now construct two important examples of groups. These can 
be done as special cases of more general constructions later, but we 
construct them “by hand” now so that we will have them available 
as we proceed. 

Example 2.1.9. We fix a positive integer n. 

We define a relation on the integers Z by x = y (mod n) if x—y 
is divisible by n (i.e., x — y = nq for some integer q). 

It is straightforward to check that = (mod n) is an equivalence 
relation. It then follows from the division algorithm (see Appendix 
A) that there are n equivalence classes, and that {0,1,...,n —1} is 
a complete set of representatives of the equivalence classes. We shall 
denote the equivalence class of i by [?], (for now). We let Z,, be the 
set of equivalence classes, Zp, = {[0]n, [I]n,..-, [2 — 1]n}. We wish to 
given Z, a group structure, and we define the group operation by 
the equation 


[in + Ln = [2 + Jn. 


A priori, this is not a valid definition. We defined the sum of two 
equivalence classes by picking a representative from each equivalence 
class and taking the equivalence class of the sum of these two repre- 
sentatives. But of course representatives are not unique. What would 
happen if we were to choose different representatives? In order for 
this definition to be valid we have to show that we would get the 
same answer, i.e., that this definition is independent of the choice of 
representatives. 

To do this, suppose [i’Jn = [i]n and [7’Jn = [J]n- 

Then i! = i+ nq, for some integer qi, and j’ = j + nq2 for some 
integer gg. But then 7’ + 7’ = (t+ 7) +nq3 where g3 = qi + qa, so [i’ + 
j'\n = [t+ J]|n- In other words, we have just shown that if [i’]n = [i]n 
and [7’Jn = [j]n, then [i’+7’Jn = [i+Jj]n, and so our group operation is 
well-defined. The rest of the properties of a group are easy to check, 
and then we find that Z,, is an abelian group of order n. © 
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Example 2.1.10. Again we fix a positive integer n and con- 
sider the equivalence relation « = y (mod n) on Z. We see 
(referring to Appendix A again) that if x = y (mod n), then 
gcd(x,n) = ged(y,n). We let Z* = {equivalence classes [7], with i 
and n relatively prime}, which makes sense (i.e., does not 
depend on the choice of representative). Then {0 < k < 
n|k and n are relatively prime} is a complete set of coset representa- 
tives of those equivalence classes in Z* (not of all equivalence classes) 
and so Z* = {[k], |,0 <k<n-—J1 and gced(k,n) = 1}. 
We give Z* a group structure by defining 


[in li]n = [t]n- 


Again we must see that the operation is well-defined, i.e., does 
not depend on the choice of representatives. So suppose [2’]n = [i)n 
and [7Jn = [J]n, so that 7’ = i+ nq, and j’ = 7 + nq for integers qi 
and q2. Then i’j’ = i7 + ng3 where q3 = iq, + Jjq2 + nqiq.- 

Now the group operations are more interesting to check. We have 
that gcd(k,n) = 1 if and only if there are integers x and y with 
ka +ny = 1. 

Closure: Let i and n be relatively prime, so ig + nr = 1 for some 
q,r. Let 7 and n be relatively prime, so js + nt = 1 for some s,t. 
Then (ig+nr)(jst+nt) =1-1=1 and (iqg+nr)(js+nt) = (ij)u+nv 
where u = qs and v = iqt+jrs+nrt, so ij and n are relatively prime. 
Thus if [i], € Z* and [j|n € Zr, lij]n € Zr. 

Associativity: Easy to check. 

Identity: [1], is the identity element of Z*. 

Inverses: If [k], € Z*, so that k and n are relatively prime, then 
there are integers x and y with kx + ny = 1, ie., kx = 1+n/(—y), so 
ka = 1 (mod n), and hence [k],[z]n = [1]n, and so [z], = [k];,?. 

Commutativity: Easy to check. 

Thus we see that Z is an abelian group of order 


p(n) = #({0 < k <n—1]|ged(k,n) = 1}). 


In particular, if n = p is a prime, we see that Z5 = {[l]p,---,[p — 
1],} has order p— 1. 


In learning to ride a bicycle, you use training wheels. Then, once 
you have learned, you remove them. We are just learning group the- 
ory, and our notation [k],, is the analog of training wheels. We shall 
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almost always use this notation here, but your should be aware that 
experienced mathematicians simply write k in this situation. Thus we 
have the equations [6]9 + [7]9 = [4]o and [5}11[8]a1 = [7]11, but math- 
ematicians will often simply say 6+ 7 = 4 in Zo and 5-8 = 7 in Zyj. 


Example 2.1.11. Given a group (G,-), we may write down the mul- 
tiplication table for G, where the entry in row a and column 0 is the 
value a- b. 

We then have, with the abbreviated notation just described, the 


multiplication table for Zs: and for Zs: 
1 2 3 4 
1/1 2 3 4 
21/2 4 1 8 
3/3 1 4 2 0 
4)};4 3 2 1 


One important way that groups often arise is as Aut(X), the 
automorphism group of a structure X. That is, we have a “structure” 
X, and we consider the invertible functions from X to itself. We 
have put the word structure in quotes, and we do not have a precise 
definition of it, but we will see some examples of it here. 

We begin with an example that you have probably seen (at least 
in disguise). 


Example 2.1.12. Let V be a vector space over F, where F is as in 
Example 2.1.4. Then G'L(V) is the group of all invertible linear trans- 
formations T: V — V. Linear transformations are those functions 
that preserve the vector space structure. If V is finite-dimensional, 
say dim(V) = n, and we choose a basis B of V, then we can identify 
T with its matrix [T]g in the basis B, and in that way we can identify 
GL(V) with GL,,(F), as in Example 2.1.4. (For purposes of future 
comparison, we recall that if S: V ~ V and T: V —> V are linear 
transformations, then ST: V — V is the linear transformation given 
by ST(v) = S(T(v)). We observe several things: Multiplication in the 
group is composition of functions. In the composition ST, we apply 
the function on the right (that is, T) first. Associativity in the group 
follows from the fundamental fact that composition of functions is 
associative. © 
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Example 2.1.13. Let X be a set (finite or infinite). (Thus X has no 
additional structure beyond being a collection of its elements.) Then 
Aut(X) consists of the invertible functions 0: X — X. An invertible 
function is just a bijection, so we are just rearranging the elements of 
X. Such a function is called a permutation of the elements of X. The 
group Aut(X) is called the symmetric group on the elements of X. 

For example, let X = {1,2,3,4,5}. Then we have o € Aut(X) 
when o is the function o(1) = 1,0(2) = 3,0(3) = 5,0(4) = 4,0(5) = 
2, and we have 7 € Aut(X) where 7 is the function 7(1) = 2,7(2) = 
4,7(3) = 3,7(4) = 5,7(5) = 1. Then p = o7 € Aut(X) is the 
function 


p(1) = or(1) = o(r(1)) = o(2) =3 
p(2) = o7(2) = o(7(2)) = o(4) =4 
p(3) = o7(3) = o(7(3)) = o(8) =5 
p(4) = a7(4) = o(7(4)) = 0(5) = 2 
p(5) = o7(5) = o(r(5)) =o) = 1. 


You can check that if X = {1,2,...,n}, then Aut(X) is a group 
of order n!. In this case Aut(X) is called the symmetric group on n 
elements, and is denoted S,. This is a very important group. While 
we will say no more about it now, we will be considering it in detail 
later (see Section 2.10). 0) 


Example 2.1.14. Let n > 3 and let X be a regular n-gon. For the 
sake of definitiveness we let X have vertices the complex n-th roots 
of 1, exp(2aik/n) for k = 0,...,2—1. We label each vertex by the 
value of k, so that the complex number 1 is labeled as vertex 0 and 
the numbering increases as we go counterclockwise. 

We let G = Aut(X), the group of symmetries of X. Let us first see 
how many elements G has. To do this, note that any symmetry of X 
must take a vertex to a vertex. Thus if we begin with the vertex 0, 
it can go to any vertex 0, 1,2,...,2—1, so there are n possibilities. 
Once we choose that 0 goes to some vertex k, we can ask where does 
1 go to. The vertex 1 is adjacent to 0, so its image must be adjacent 
to k. We can choose either of the two possibilities: k+1 or k—1 (mod 
n). So there are 2n possibilities. But note that once the images of 
O and 1 are fixed, the images of every vertex are determined. Thus, 
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|G| = 2n. This group is denoted D2, and is called the dihedral group 
of order 2n. 

Let us examine this group more closely. Of course we have the 
identity map on X (i.e., the map which leaves X pointwise fixed), 
and this is the identity element of D2,, which we denote by e. Next, 
we have the element a@ of D2, which rotates X 27/n degrees counter- 
clockwise, so that a(0) = 1,a(1) = 2,...,a(n—2) =n—-1,a(n—-1) = 
0. And next, we have the element (6 of Dg, which reflects X across 
the x-axis, so that 6(0) = 0,6(1) = n—-—1,8(n —1) = 1,6(2) = 
n — 2,B(n — 2) = 2,.... Note a” = e and 6? = e. Then as a set 
Don has 2n elements {a’8J|0 <i<n—1, 0 <j < 1}. Note that 
(a’)-! = a" and 67! = 8B. Of course, by a8 we mean the sym- 
metry of X obtained by first applying 6 and then applying a. We 
compute 


We observe that Ga 4 af and so D2, is not abelian. 
We experiment a little further and find 
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Ba (2) = B(a*(2)) = Bl) =n-1 


Ba-'(n — 1) = B(a7“"(n — 1) = B(n— 2) =2 


and thus we see that Ba~! = a8. Then Ba~? = (Ba!)a“t = 
(aB)a~! = a(Ba-!) = a(aB) = a8 and in general we see that 
Ba~* = a8. Furthermore, for any i, (a’8)(a’B) = (a’B)(Ba~") = 
a'(B?)a* = at(eja* = ata *t = e, and more generally 
(a'B)(aB) = (a°8)(Ba~?) = a'(B?)a7F = ai(e)ad = a’ 4, This 
enables us to write down the multiplication table for D2,. We do so 
for n = 3. 


* 8B  oB af 


€ a a 
e€ € a Qa B ab arp 
Q Qa a? e€ aB a’?B B 

a? a? € a ab B aB 
2 


B B of8 o8B e a@ a 
aB | af B ad@B a € a? 
a 


a°Bla’B ab 8B : 


Q 


We leave you to check that, geometrically, f{e,a,...,a"~1} are all 
rotations and {3,a8,...,a"~!} are all reflections. (Warning: Our 
notation D2, is the most common one, but some authors denote this 
group by Dy.) o 

If we let i = /—1, then it is easy to check that G = {1,7, —1, —7} is 
a group under multiplication. Here is a related, and more interesting, 
eroup. 


Example 2.1.15. The quaternion group Qs is the group 


On={ I, a, ve k} 


with 7? a k? —1, ij = k, and ji = —k. These last 
two relations have the consequences that jk = 1, kj = —i and 
ki = 9, ik = —j, as you may check. Here is the multiplication table 


for Qs. 
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i #« =< =~ 4 =) & =k 


(By the way, there is no standard way to order the elements of a 
group, so multiplication tables for the same group may look different. 
However, although it is not logically necessary to do so, everybody 
puts the identity element first.) 

Here is a way to get new groups from old. 


Definition 2.1.16. Let G and H be groups. Their product G x H 
is the group 


Gx H={(g,h)|g€G,he A} 


with product defined by (91, /1) - (92, he) = (9192, hih2). 0 


We leave it to you to check that G x H is a group. 
Finally, we record a construction for future use. 


Definition 2.1.17. Let G be a group and let A and B be subsets of 
G. Then AB is the subset of G given by AB = {ab|a € A,b € B}. If 
A = {a} consists of a single element, we write aB instead of {a}B, 
and similarly if B consists of a single element. 0) 


For arbitrary subsets A and B, this construction is not very useful, 
but we will see particular cases in which it is not only useful, but 
indeed essential. 

We record one property of this construction. 


Lemma 2.1.18. Let G be a group and let A, B, and C be subsets of 
G. Then (AB)C = A(BC). 


Proof. This follows directly from the associative law (ab)c = a(bc) 
for multiplication of individual elements of G. 
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2.2 Homomorphisms and isomorphisms 


We now want to consider mappings between groups. What should a 
mapping y: G > H from a group G to a group H be? On the one 
hand, a group is a collection of elements, so y should be a function 
from the set of elements of G to the set of elements of H. But on 
the other hand, a group is more than just the set of its elements. A 
group has an operation as well. Thus to be a mapping of groups, y 
has to “respect” the operations on G and H. This leads us to the 
following definition. 


Definition 2.2.1. A homomorphism y: G — H from a group G to 
a group #H is a function y: G + H with the property that y(ab) = 
y(a)yp(b) for every a,b € G. 


Here are two simple, but basic, properties of homomorphisms. 
Lemma 2.2.2. Let y: G— H be a homomorphism. Then: 


(i) ple) =e; and 
(ii) For anya é G, y(a) = y(a)*. 


Proof. 


Example 2.2.3. 


(a) For any group G, id : G > G is a homomorphism. 

(b) For any groups G and H, y: G > H defined by ¢y(g) = e for 
every g € G is a homomorphism. 

(c) Let G and H be any groups. Then we have homomorphisms 
47:Go4Gx Hig: H > Gx H,m:Gx HA > G, and 
72: Gx H — H defined by: 


i1(g) 
i2(h) 
(9, h) 

h) 


TY 
2 (g, 


(g,e) forevery gEG 

(e,h) forevery he H 

g forevery (g,h)€GxH 

h forevery (g,h)€GxH i. 
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Our next set of examples will be of homomorphisms between 
abelian groups, and in these examples we will write the groups 
additively. 


Example 2.2.4. 


(a) For any fixed integer n, we have a homomorphism vy: Z > Z 
defined by y(z) = ni. We observe that y is a homomorphism as 
gi+j) =nitj) =nitnj = pli) + (9). 

(b) For any fixed integer n, we have a homomorphism vy: Z > Z,y, 
defined by y(2) = [t]n. We observe that y is a homomorphism as 
git jf) = [i+ dn = [ln + Lin = Y@) + (9) by the definition of 
addition in Z,. 

(c) For any fixed integers m and n, we have y: Z, — Z,, defined by 
P([iIn) = [mi]n. Again v([i]n + [i]n) = (lt + Jn) = mi t+ jn = 
[m(i + Dn = [mit+ mj|n = [mi], + [Mj]n = mtn + mln = 
oli) + (3). 

(d) For any fixed integers m and n with n a multiple of m, we 
have vy: Zn — Zy, defined by ¢y([i]n) = [i]m. Here we need to 
be careful to ensure that y is well-defined. The problem is that 
we have defined y({i],,) by choosing a representative i of this 
equivalence class, and we need to know that this value does not 
depend on our choice. Thus consider 7 with [i], = [i]n. Then 
~([t’|n) = [’]m and so we need to show that [i’]m = [i]m. Now 
[t’]m = [t]m is the equation 7’ = i (mod m), or equivalently, that 
i! — i is divisible by m. However, we chose i’ with [#’]n = [i]n, 
which similarly means that 7’ — 7 is divisible by n. But, since we 
are requiring that n is a multiple of m, if i! —7 is divisible by n 
then certainly i! — 7 is divisible by m, which is what we needed 
to show. © 


Definition 2.2.5. Let G and AH be groups. A homomorphism 
y: G > His an isomorphism if there is an inverse homomorphism 
y !: H -G. An isomorphism y: G > G is called an automorphism 
of G. If there is an isomorphism y: G — H, the groups G and H are 
isomorphic. 0 


Here is a criterion for a homomorphism to be an isomorphism. 


Lemma 2.2.6. Let y: G — H be a homomorphism. Then yp is an 
isomorphism if and only if yp is 1-1 and onto (1.e., if and only if ~ is 
a bijection). 
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Proof. If y: G — ZH is an isomorphism, by definition it has an 
inverse y-!: H — G. But ¢ is a function, and to be invertible as a 
function it must be 1-1 and onto. 

Conversely, suppose y: G — H is 1-1 and onto. Then vy has an 
inverse function y~! : H > G. We must show that y~! is a homo- 
morphism. To this end, let x,y € H and let a= y '(x),b = yp" (y). 
Then y(a) = x and ¢(b) = y. Since y is a homomorphism, y(ab) = 
9(a)p(b) = ry, so y"'(zy) = ab = yt (x) yp" (y), as required. 


Remark 2.2.7. This lemma says that if y is invertible as a map of 
sets, it is automatically invertible as a map of groups. But it is not 
always the case that if y satisfies an extra condition then y~! does. 
Here is an example. Let X = {x €R|0<2<1 oraz = 2} and let 
Y={rc£eER|O0<a2< 1}. Define f: X >Y by f(x) =z if0<ar<1 
and f(2) = 1. Then f: X — Y is a continuous function, which is 
1-1 and onto, so has an inverse f~!: Y > X. in fact, f—! is given 
by f-l(az) = 2 if0 < 2 <1 and f—1(1) = 2. But note that f—1 is 
not continuous. v) 
Here is what, in some sense, is the simplest kind of group. 


Definition 2.2.8. A group G is cyclic if it consists of the powers 
(positive and negative) of some element a of G. Such an element is 
called a generator of G. © 


Theorem 2.2.9. 


(a) An infinite cyclic group G is isomorphic to Z. 
(b) A finite cyclic group G is isomorphic to Zp, where n is the order 
of G. 


Proof. Let G be cyclic with generator a. Suppose that there is no 
nonzero value of k for which a* = e. Then we have an isomorphism 
yp: Z > G given by y(n) = a”. (y is a homomorphism by the laws 
of exponents. y is onto by the definition of a cyclic group. ¢ is 1-1 
as. if my - a, Gln) = o™ + olig) = o™, 28 1 a™@ —a™ then 
a™—"2 = e,) This proves part (a). 

On the other hand, suppose a” = e for some nonzero value of 
k. Since a~* = (a*)~!, then a* = e for some positive value of k. 
Let k = n be the smallest such positive value. We first observe that 
a* = e if and only if k is a multiple of n. For we may write k = nq+r 
where r is an integer with O<r<n. 


k 


42 An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


Then a* = a™@+" = (a”)%a" = ea" = a’. If r = 0 then certainly 
a® =e. But ifr £0 then a” # e, as r is a positive integer less than n 
and by definition n is the smallest positive integer with a” = e. This 


same computation shows that as a set, 
G={a |0 sre a)=fea..50""} 


and G has order n. We claim we have an isomorphism y: Z, > G 
given by y([k]n) = a”. The first thing we need to see in that this 
definition makes sense, as [k], is an equivalence class. So suppose 
[kln = [k’]n. Then k! = k + ng for some q and then a*’ = a*t™4 — 
a’a’? = ake = a*, and so the definition of y does not depend on 
our choice of representatives. But given that, y is an isomorphism. 
(Again y is a homomorphism by the laws of exponents, y is onto 
by the definition of a cyclic group, and ¢ is 1-1 as if [ki]n 4 [koln, 
a 4 a’ as ky — kg is not divisible by n.) This proves part (b). 


Remark 2.2.10. You will notice that in the proof of part (b), we 
had an extra step where we had to show that y is well-defined. This 
will follow as part of a general result we will prove later, but we 
decided to get started by proving this particular case “by hand”. 


Suppose we want to check that y is onto. In general, there is no 
shortcut. But suppose we want to check that y is 1-1. Here we have 
a shortcut. 


Lemma 2.2.11. Let y: G— H be a homomorphism. Then vp is 1-1 
if a#e implies y(a) # e. 


Proof. To say that y is 1-1 is to say that for any two elements a 
and b of G with a 4 b, y(a) # y(b), so this is just a special case. 
On the other hand, suppose that y(a) 4 e for any a # e and 
consider two elements a 4 b of G. Then ab~! 4 e, so y(ab-!) Fe. 
But y(ab-t) = y(a)y(b)“! so y(a)y(b)-! ¥ e and hence y(a) # 
(0). 


Example 2.2.12. 


(a) Let y: Z > Z be multiplication by n, y(t) = ni. Then is neither 
1-1 nor onto if n = 0, y is both 1-1 and onto if n = +1, and y is 
1-1 but not onto if |n| > 1. 

(b) Let n be a positive integer and let y: Z > Z,, by y(t) = [in. 
Then gy is onto but not 1-1. 
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(c) Let m and n be fixed positive integers and let y: Z > Z, be 
Y([iln) = [mi]n. Let d = gcd(m,n) be the greatest common divi- 
sion of m and n. If d > 1, then note that [n/d]jn 4 [0], but 
g([n/d|n) = [mn/d|n = [(m/d)n|n = (m/d)[n|n = (m/d)[0]n = 
[0], Thus, we see that y is not 1-1. 


Now y is a function from the finite set Z,, with n elements, to 
the finite set Z,, also with n elements. Thus, since y is not 1-1, we 
also have that y is not onto. 

On the other hand, suppose that m and n are relatively prime, 
ie., that d = 1. Then there are integers 7 and y with 1 = mz + ny. 
Thus for any integer k, k = mkx + nky so [k]n = [Mkaz]n = m[ka]n. 
In other words, [k], = y([k2]n) and so we see y is onto. Again we 
note that y is a function from the set Z,, with n elements, to the set 
Zn, with n elements, so this time, since y is onto, y is also 1-1. 9 


We have said that one way groups arise is as the automorphism 
groups of various kinds of “structures”. We may let that structure 
itself be a group, and so we are led to the following definition. 


Definition 2.2.13. Let G be a group. The automorphism group 
Aut(G) of G is the group of all automorphisms y: G > G (with 
the group operation being composition). © 


Lemma 2.2.14. Aut(Z,,) is isomorphic to Z*. 


Proof. Let y: Zn — Zp be a homomorphism. Then y([k]n) = 
ky([1]n), so vy is determined by y([1]n). Let y([1]n) = [Mn = m[L]n. 
Then y([k]n) = m[k]n. Thus ¢ is one of the homomorphisms consid- 
ered in Example 2.2.12, and we saw there that vy is an isomorphism 
exactly when m is relatively prime to n. Now for any m and m’ (rel- 
atively prime to n or not), if [mjn = [m’]n, and y([k]n) = m[k]n, 
y'([k]n) = m’'[kln, then y = ¢! (ie., y: Zy  Zy is determined 
by y((l]n)). Thus, we have a 1-1 onto map, i.e., an isomorphism 
of sets, ®: Aut(Z,) > Z* given by ®(y~) = ¢([L]n). To conclude 
that ® is an isomorphism of groups it remains to show that ® is a 
homomorphism. So let yi([1]n) = [mijn and y2([1n) = [me]n. Then 
P(y21) = v2yi([]n) = va(yi([ln)) = ¢2(lmijn) = me[miln = 
[mgm |n = [Me]n[M1]n = O(y2)@(y1) as required. 


We conclude this section with a particularly interesting, and use- 
ful, isomorphism, and its consequences. 
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Theorem 2.2.15. Let m and n be positive integers. Then Zmn is 
isomorphic to Zm X Zn if and only if m and n are relatively prime. 


Proof. The interesting case is when m and n are relatively prime, 
but let us get the other case out of the way first. 

Suppose m and n are not relatively prime. Then m and n have 
a least common multiple 1 < mn. let yg: Zmn — Zm xX Zn be any 
homomorphism. Then ¢([l]mn) = ([%]m, [y]n) for some x, y. But then 
mn #0 while p([]mn) = ([l2]m; [lyln) = ([0lm; [O]n) so v is not 1- 
1. Thus, we see there is no possible isomorphism between Z,, and 
Zim X Zn, i.€e., these two groups are not isomorphic. 

Now suppose m and n are relatively prime. By Example 2.2.4(d), 
we have homomorphisms y1 = Zmn — Zm given by ¢1([k]mn) = 
[K]m and yo: Zmn > Zn given by Yo([k]lmn) = [k]ln. Then we 
have a homomorphism y = (91,2): Zmn — Zm X Zn, i.e., 
e({khmn) = (1(klmn);$2((klmn)) = (lh]ms (kn): (In particular, 
p([1]mn) = ([L]m, [1]n).) We claim that y is an isomorphism. Sup- 
pose 9([k]mn) = ([O)m,[0]n). Then [k]m = [O}m and [A]n = [O)n, ie., 
k is divisible by m and & is divisible by n. Since m and n are rel- 
atively prime, that implies k is divisible by their product mn, i-e., 
[K]mn = [O]mn- Thus ¢ is 1-1. Since Z,, has mn elements, as does 
Zim X Zn, that implies that vy is onto as well, so y is an isomorphism, 
and these two groups are isomorphic. 


Corollary 2.2.16. Let mj,,...,mx be pairwise relatively prime pos- 
itive integers (i.e., mj and m; are relatively prime whenever i F j.) 
Let M=m,...mg. Then Zy is isomorphic to Zm, X-++X Zm,, with 
an isomorphism p given by y([1]az) = ([L]mz,--+ > [Lm,)- 


Proof. This follows from Theorem 2.2.15 by induction on k. 


Corollary 2.2.17 (Chinese remainder theorem). Let mj,..., 
mz be pairwise relatively prime positive integers and set M = 
my ,...M x. Let ay,...,a,% be arbitrary integers. Then there is an inte- 
ger x satisfying the congruences 


x = a,(mod m1) 


x = az(mod mz) 


and x is unique (mod M). 
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Proof. By Corollary 2.2.16, we have an isomorphism y: Zy — 


Zm, X +++ X Zm,- Since y is an isomorphism, it has an inverse go. 


Then zx is given by 


[z] a — go ((e4 ln tty [ak |my)- 


Remark 2.2.18. The Chinese Remainder Theorem has a generaliza- 
tion, which is best understood in the context of ring theory. But, as 
we have just seen, Corollary 2.2.17, the original case of this theorem, 
can be understood as a theorem in group theory/number theory. © 


2.3 Subgroups 


The basic idea of a subgroup is a very simple one, but subgroups 
play an important role. 

In this section we will usually write groups multiplicatively, as 
we will usually be considering general (i.e., not necessarily abelian) 
groups. 


Definition 2.3.1. Let G bea group. H is a subgroup of Gif H CG 
(i.e., H is a subset of G) and H is a group (with the same operation 
as in G). H is a proper subgroup of G if H CG. © 

First, we have a very simple criterion for a subset H of G to bea 
subgroup of G. 


Lemma 2.3.2. A subset H of G is a subgroup of G if and only if: 
(i) eed. 

(ii) Ifa,b€ H then abe H. 

(iii) fae H thena' eH. 


Proof. These are just the group axioms for H, except for associa- 
tivity, which is automatic, as it holds in G. 


Remark 2.3.3. If G is abelian then any subgroup H of G is 
abelian. O 


Remark 2.3.4. Every group G has the trivial subgroup {e}, and G 
itself is a subgroup of G. © 
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Example 2.3.5. 
(a) We have subgroups 
(Z, +) € (Q, +) € (R, +) € (C, +). 
(b) We have subgroups 
({+1},-) S (Q — {0}, -) S (R— {0}, -) C (C— {O},-). 9 


AS 


Example 2.3.6. 


(a) Let R = Z,Q,R, or C and let n be a positive integer. 
We have (among others) the following subgroups of GL,,(R), the 
group of invertible n-by-n matrices with entries in R. (We do not 
bother to repeat n and R, for simplicity.) 


{identity matrix I} 
C {invertible scalar matrices} 
C {invertible diagonal matrices} 


C {invertible upper triangular matrices} 


C GL, (R). 
(b) GL, (R) also has the subgroup 
SL,(R) = {matrices of determinant 1}. . 


Example 2.3.7. Let G, and G2 be groups, and let G = G, x Go. 
Then G, x {e} and {e} x G2 are both subgroups of G. .) 


Lemma 2.3.8. The distinct subgroups of Z are nZ = {ni|i € Z} 
for some nonnegative integer n. 


Proof. Let H be a subgroup of Z. If H = {0}, then H = OZ. 
Assume H 4 {0}. Then H contains a nonzero integer k. Since H is 
a subgroup, if k € H then —k € H, so we may assume k is positive. 
Thus, we see that H contains a positive integer k. Let n be the 
smallest positive integer in H. We claim H = nZ. 

Certainly nZ C H, by closure. To show H C nZ, let m € H. Then 
we may write m = nq+r for integers gq and r with 0 < r < n. If 
r #0, then r = m—nq = m+(—n)q € A with 1 <r < n, impossible 
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as n is the smallest positive integer in H. Hence r = 0, m = nq, and 
A CndZ. 

From our construction we see that these subgroups are all 
distinct. 


We now consider the situation of a group G and a subgroup H, 
and define a pair of (closely related, but in general not the same) 
equivalence relations on the elements of G, which will lead us to the 
important notions of left and right cosets of H in G. 


Definition 2.3.9. Let G be a group and let H be a subgroup of G. 
We define the equivalence relations x =; y (mod H) and x =, y (mod 
Hf) by: 


z= y (mod H) if «lyedH, 
zt=,py(mod H) if ay led. .) 


Remark 2.3.10. If G is abelian then 2~'y = y~'a so these two 
equivalence relations are the same, and we abbreviate them to x = y 
(mod #). % 


While in general distinct, these two equivalence relations are sim- 
ilar, so we will henceforth state results for both, but only prove them 
for x =; y (mod H). 


Lemma 2.3.11. The relations x = y (mod H) and x =, y (mod H) 
are equivalence relations. 


Proof. We must check the three properties of an equivalence 
relation. 

Reflexive: x =, x (mod H), as x~!x = e € H, since H isa 
subgroup. 

Symmetric: Suppose x =, y (mod H), so aly € H. Since H is 
a subgroup, (a—'y)~! € H. But («~'y)-! = y“!2, so yx € H and 
y =. « (mod H). 

Transitive: Suppose x =; y (mod H) and y = z (mod H). Then 
x 'y € H and yz € H. Since H is a subgroup (x~'y)(y~!z) € H. 
But (271y)(y~1z) = 271z, so a-!z € H and x = z (mod H). 


Definition 2.3.12. The equivalence classes of elements of G under 
the relation x =; y (mod #) are the left cosets of H in G, and the 
equivalence classes of elements of G under the relation x =, y (mod 
Hf) are the right cosets of H in G. 0) 
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Lemma 2.3.13. 


(a) Consider the relation x = y (mod H). Then 


(i) The equivalence class of e is H, i.e., the left coset of H 
containing e is H. 
(ii) More generally, the equivalence class of the element x of G 
is ©H, i.e., the left coset of H containing x is xH. 
(iii) Any two left cosets of H are either identical or disjoint. 
(iv) The left cosets of H partition G. 


(b) Consider the relation x =, y (mod H). Then 


(i) The equivalence class of e is H, 1.e., the right coset of H 
containing e is H. 
(ii) More generally, the equivalence class of the element x of G is 
Hx, 1.e., the right coset of H containing x is Hx. 
(iii) Any two right cosets of H are either identical or disjoint. 
(iv) The right cosets of H partition G. 


Proof. 


(i) The equivalence class of e under x = y (mod #) is {y € G|e = 
y (mod H)} = {ye Gle tye H}={yeGleye H}={ye 
Gly€ HA} =dH. 

(ii) First we observe that zH = H if and only if z € H. To see this, 
note that if z © H, then zh € A for any h € H,so zH C H, and 
for any h € H, h = z(z~'h) so H C zH, and so they are equal. 
On the other hand, if z ¢ H, then ze = z € zH so zH # H. 
Then = y (mod H) 8a lye Hea 'lyH =H & yH = 
xH (where the last = is Lemma 2.1.18) and, similarly, yH = 
cH eye crH. 

(iii) and (iv) are properties of equivalence classes in general (Corol- 
lary 1.2.24 and Theorem 1.2.26). 


Lemma 2.3.14. Let H be a subgroup of G. Then for any x € G. 
jen | =|H| ond |Ha =A). 


Proof. We have a 1-1 correspondence f: H — «H given by f(h) = 
th. 


Lemma 2.3.15. Let H be a subgroup of G and let {x;} be a complete 
set of left coset representatives of H inG. Then {a;*} is a complete 
set of right coset representatives of H in G. 
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Proof. We have a 1-1 correspondence f: G > G given by f(g) = 
g '. Then G = Uz;H so applying this correspondence, f(G) = 
Uf(c;H). But f(G) =G and 
f(sH) = {f(aih) |b € H} = {(ash)1 |b € HY 
= {hla |he H} = Haz, 


Definition 2.3.16. Let H be a subgroup of G. The index [G : H 
of H in G is the number of left (or right) cosets of H in G. o 


Theorem 2.3.17. Let G be a finite group and let H be a subgroup 
of G. Then |G| =|A|[G: A]. 


Proof. Let |G| = n,|H| = m, and |G: H] =k. Let {21,...,x%} 
be a complete set of left coset representatives of H in G. Then 


k k 


k 
G=|[ae so |ej/=S aa. n= > la, 
t=1 


i=1 i=1 


But |z;H| = |H| =m for every i, by Lemma 2.3.14. 


Corollary 2.3.18 (Lagrange’s theorem). Let G be a finite group 
and let H be a subgroup of G. Then |H| divides |G|. 


Proof. Immediate from Theorem 2.3.17. 


Definition 2.3.19. Let G be a group and let a € G. The order of 
a is m if m is the smallest positive integer such that a” = e. The 
element a has infinite order if no such m exists. © 


Corollary 2.3.20. Let G be a finite group and let a € G. Then the 
order of a divides the order of G. 


Proof. Let |G| = n. If a has order m, then H = {e,a,...,a’~+} 
is a subgroup of G, and so |H| = m divides |G| = n by Lagrange’s 
Theorem. 


Corollary 2.3.21. Let G be a group of prime order p. Then G is 
cyclic, generated by any element a # e of G. 
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Proof. By Corollary 2.3.20, the order of a must divide p. Since 
a # e, the order of a is not 1. Hence a has order p, and so H = 
fe.d..c: a “b= G, 


We now want to look carefully at the (potential) difference 
between left and right cosets of a subgroup H of a group G. They 
can only differ in case G is nonabelian, so we look at a nonabelian 
group G. 


Example 2.3.22. We have introduced the dihedral groups Do, in 
Example 2.1.14. 

We begin by considering G = Dg. Then, in the notation of that 
example, G = {e,a,a’, B,a8,a7G}. We let N = {e,a,a7}, Hp = 
{e, 8}, Hy = {e,aB}, Hy = {e,a78}, and we observe that N, Ho, 
A,, and Hg are all subgroups of G, of orders 3, 2, 2, 2 respectively. 

First let us consider N. Since N has order 3, and G has order 6, 
[G : N] = 2, so N has two left cosets, and two right cosets. A little 
computation shows: 

{left cosets of N} = {{e,a,a7}, {8,a8, 076}, 

{right cosets of N} = {{e,a,a7}, {8,a8,a7B}}. 

Thus in this case the left cosets of N and the right cosets of N 
agree. 

On the other hand, let us consider Hp. Since Ho has order 2, and 
G has order 6, [G : Ho] = 3, so Ho has three left cosets, and three 
right cosets. A little computation shows: 

{left cosets of Ho} = {{e, 8}, {a, a8}, {a7 a7 B}}, 

{right cosets of Ho} = {{e, B}, {a, a7 5}, {a, aB}}. 

Thus in this case the left cosets of Hp and the right cosets of Ho 
disagree. 

Similarly we compute: 

{left cosets of H,} = {{e, a8}, {a, a? 8}, fa?, Bh}, 

{right cosets of Hi} = {{e, a6}, {a, B}, {a7, a7 B}}, 
and 

{left cosets of Hz} = {{e, a?B}, fa, B}, {a?, aB}}, 


{right cosets of H2} = {{e, a7 8}, {a, aS}, {a7, B}}, 
so the left cosets of H; and the right cosets of Hy, and the left cosets 
of H2 and the right cosets of H2, also disagree. © 


Remark 2.3.23. Before going any further we want to emphasize 
that when we write a left coset of a subgroup H (for example) as 
aH, we are making a choice of the coset representative (i.e., the 
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representative of the equivalence class), and that we could equally 
will make a different choice and write the coset as a’ H. For example, 
referring to our computations in Example 2.3.22, the two left cosets 
of N (in the order written) are eN = aN = a?N, and BN =a8N = 
a?8N, while the two right cosets are Ne = Na = Na?, and NB = 
Na = Na?8. For Ho, the three left cosets are eH) = GH, aHp = 
aBHo, and a?Hp = a?8Hpo, while the three right cosets are Hoe = 
Hof, How = Hoa?B, and Hoa? = Hoa (and similarly for H, and 
Hz). In principle, all choices of coset representatives are equally good. 
(In practice, we always choose the representative of the left or right 
coset H of H to be the identity element e, but that is a choice, and 
we could equally well make a different one.) © 


We now want to look at subgroups whose left and right cosets 
agree (such as the subgroup N of Dg in Example 2.3.22.) While this 
may seem to be a curiosity, it turns out to be an extremely important 


property. 


Definition 2.3.24. Let G be a group, and let N be a subgroup of 
G. Then N is a normal subgroup of G, written N <G, if every left 
coset of N in G is a right coset, and vice-versa. © 


Lemma 2.3.25. Let G be a group and let N be a subgroup of G. 
The following are equivalent: 


(i) N is a normal subgroup of G. 


) 

) a 'Na=N for everyae G. 
) aNa-!=N for everya€ G. 
) a !NaCN for everyaeG. 
) aNa! CN for everyae G. 


Proof. First observe that (iii(a)) and (iii(b)) are equivalent, and 
that (iv(a)) and (iv(b)) are equivalent, since each is obtained from 
the other simply by replacing a by a7!. 

Suppose (i) is true. Then for any a € G, aN = Nb for some b. But 
note a= ae € aN and soa€ Nb. But a= ea € Na. Since two right 
cosets are either identical or disjoint, we must have Na = Nb and 
so aN = Na. On the other hand, if (ii) is true then (i) is certainly 
true, as (ii) is a more specific statement than (i). Thus (i) and (ii) 
are equivalent. 
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Now aN = Na & a-'(aN) = a-}(Na) & N = a'Na by 
Lemma 2.1.18, so (ii) and (iii(a)) are equivalent. 

Certainly (iii(a)) implies (iv(a)), as (iii(a)) is a stronger state- 
ment than (iv(a)). On the other hand, suppose (iv(a)) is true. Then, 
as we have observed, (iv(b)) is true. But also a7'Na C N & 
a(a~!Na)a“! C aNa“! & N C aNa7}, again by Lemma 2.1.18, 
so (iii(b)) is true, and hence so is (iii(a)). 


Lemma 2.3.26. Let G be an abelian group. Then every subgroup of 
G is normal. 


Proof. For every a € G, and every « € G, a-!za = x. 


Example 2.3.27. The converse of Lemma 2.3.26 is false. Let G = Qg 
be the quaternion group of order 8 as in Example 2.1.15. Then G is 
a nonabelian group, but it is straightforward to check that every 
subgroup of G is normal. © 


Now let us relate subgroups and homomorphisms. 


Definition 2.3.28. Let y: G— H be a group homomorphism. 
The image of » is Im(vy) = {h € H|h = y(g) for some g € G}. 
The kernel of y is Ker(y) = {g € G| y(g) = e}. .) 


Much of the importance of normal subgroups comes from part (b) 
of the following lemma. 


Lemma 2.3.29. Let y: G— H be a homomorphism. 


(a) Im(y) is a subgroup of H. 
(b) Ker(y) is a normal subgroup of G. 


Proof. 


(a) We have to verify the three properties of a subgroup. By Lemma 
2.2.2, p(e) =e so e € Im(y). If hi, ho € Im(y), then hy = y(g1) 
for some g; € G, and hz = y(gz) for some gz € G, so, since y 
is a homomorphism, hihs = o(91)9(92) = (gigs) and hyho € 
Im(y). Finally, if hi € Im(y), so hi = y(gi) for some gi € G, 
then hy’ = y(g,') by Lemma 2.2.2 again, so hy,’ € Im(y). 

(b) First we verify the three properties of a subgroup. By Lemma 
2.2.2, p(e) = e soe € Ker(y). If gi, go € Ker(y), then 
y(gi) = e, v(g2) = e, So, since y is a homomorphism, 
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o(gig2) = ¥(91)e(g2) = ee = €, 80 gigg € Ker(y). Finally, 
if g, € Ker(y) then y(gi) = e, so, by Lemma 2.2.2 again, 


y(g7 1) = e7! =e, so gz € Ker(y). 


Now we must show that Ker(y) is in fact a normal subgroup of 
G. We do so by verifying condition (iv(a)) of Lemma 2.3.25. Let 
gi € Ker(y), so that y(g1) = e, and let a be an arbitrary element 
of G. Then y(a~'gia) = yla")e(g)ela) = v(a)*v(m)y(a) = 
y(a)-tey(a) = y(a)-y(a) = e, so a-'gia € Ker(y), and hence 
Ker(y) is a normal subgroup of G. 


It is natural to ask whether every subgroup of H is the image of 
some homomorphism (from some group), and whether every normal 
subgroup of G is the kernel of some homomorphism (to some group). 

Here is the easy positive answer to the first of these two 
questions. 


Lemma 2.3.30. Let K be a subgroup of H. Then K is the image of 
a homomorphism. 


Proof. Let i: K — H be the inclusion, given by i(k) = k for every 
k © K. Then Im(i) = K. 


The answer to the second question is also yes, but it takes much 
more work to show it, and we do so in the following section. 
But let us conclude this section with a few more examples. 


Definition 2.3.31. Two elements g; and go of a group g are conju- 
gate if gg = a~'g,a for some a € G. Two subgroups Hy and Hy» of G 
are conjugate if Hy = a~'Hyja for some a € G. © 


Lemma 2.3.32. Being conjugate is an equivalence relation on 
{elements of G} and on {subgroups of G}. 


Proof. Straightforward. 


Remark 2.3.33. We observe that H is a normal subgroup of G if 
and only if its only conjugate is itself. % 


Example 2.3.34. 


(a) Let G = GL,(R) be the group of Example 2.3.6. Let y: G > H 
(where H = {+1} if R=Z, and H = Q—{0},R—{0}, or C—{0} 
if R= Q,R, or C) by y(A) = det(A), where det(A) denotes the 
determinant of the matrix A. y is a homomorphism by a basic 
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property of determinants: y(AB) = det(AB) = det(A)det(B) = 
~p(A)p(B). Then Ker(y) = SL,(R) (as in Example 2.3.6(b)) is 
a normal subgroup of G. 

(b) Let n > 2 and let U and L be the subgroups of G is given by: 


U = {upper triangular matrices in G} 
L = {lower triangular matrices in G} 
Then U and L are conjugate subgroups of G, so neither U nor L is a 


normal subgroup of G. We do the computations to show this in case 
n = 2, and leave the general case to the reader: 


—1 
_ 2 a b//o 1 c 0 
F | k 4 Fi | = k "| for every a,b,c € R. 


2.4 Quotient groups 


Definition 2.4.1. Let G be a group and let N be a subgroup of G. 
Then G/N is the set of left cosets of N in G. ?) 


In general, G/N is a set — nothing more. But if N is a normal 
subgroup of G, then G/N has the structure of a group. We prove 
this in stages. 


Lemma 2.4.2. Let G be a group and let N be a subgroup of G. The 
following are equivalent: 


(i) N is a normal subgroup of G. 
(ii) For any two left cosets Ly and Lz of N in G, their product LiL 
(in the sense of Definition 2.1.17) is a left coset of N in G. 


Proof. First suppose (i) is true. Let ZL; and Lz be left cosets of 
N in G. Choosing left coset representatives, we have Ly = aH and 
Ly = bH for some a,b € G. Observe that, for any subgroup N of G, 
NN = N. Then, using the fact that N is normal, and using Lemma 
2.1.18, we have that 


LL» = (aN)(bN) = (aN)(Nb) = aNNb = aNb = abN 


so LL is a left coset of N in G. 

Next suppose (ii) is true. Let a be any element of G. Let Ly be 
the left coset L; = aN and let Lz be the left coset Ly = a! N. Then 
LL is a left coset of N in G. Which left coset can it be? 
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Since e € N, the left coset Lj Lz = aNa'N contains the element 
aea~'e = e. But e € Nas N is a subgroup of G. Recalling that any 
two left cosets of N in G are either identical or disjoint (Lemma 
2.3.13), we must have L;Ly = N. That is, aNa!N = N, and so 
aNa~' CN, and then by Lemma 2.3.25, N is a normal subgroup 


of G. 


Theorem 2.4.3. Let N be a normal subgroup of G. Then G/N is a 
group. Furthermore, |G/N| = [G: NJ. 


Proof. G/N is closed under multiplication by Lemma 2.4.2. Mul- 
tiplication is associative by Lemma 2.1.18. The identity element is 
the left coset N = eN, as for any left coset L = aN, LN = (aN)(eN) = 
aeNN = aeN= aN= L and NL = eNaN = eaNN = eaN = aN = L. 
The inverse of the left coset L; = aN is the coset Lg = a 'N 
as L1L2 = (aN)\(a-!N) = aa !NN = aa !N = eN = N and 
LoL, = (a 'N)(aN) = a'aNN = a-taN = eN = N. Thus G/N 
is a group. 

By definition, |G/N| is the number of elements of G/N. But the 
elements of G/N are the left cosets of N in G, and by definition the 
number of these is [G: N]. 


Definition 2.4.4. The group G/N of Theorem 2.4.3 is the quotient 
of G by the normal subgroup N. © 


Example 2.4.5. Let G = Z and let n be a positive integer. Let 
N = nZ. Then the quotient G/N = Z/nZ is the group Z, of Example 
2.1.9. (If you go back to that example, and look at it closely, you will 
see that our construction of Z, “by hand” there agrees with our 
construction of the quotient group Z/nZ in Theorem 2.4.3.) 0) 


Remark 2.4.6. This illustrates a common theme in mathematics. 
We look at a particular situation and make a particular construction. 
Later on we look at a more general situation, and see that our par- 
ticular construction is a special case of a more general construction, 
or, viewed the other way, we see that our particular construction can 
be generalized to a (much) wider one. 0) 


Remark 2.4.7. It is impossible to overemphasize the distinction 
between a subgroup of a group G and a quotient group of a group 
G. These are two completely different animals. The elements of a 
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subgroup of G are some of the elements of G, while the elements of 
a quotient group of G are equivalence classes of elements of G, and 
these are two different things. © 


Remark 2.4.8. Again to point out the difference between subgroups 
and quotient groups, note that in Example 2.4.5, for every positive 
integer n, the subgroup nZ of Z is an infinite group, while the quo- 
tient group Z,, is a finite group of order n. © 


Remark 2.4.9. As we have seen, left cosets are equivalence classes. 
If L is a left coset of N, then choosing any representative of this 
equivalence class, i.e., any element a of L, we have that L = aN, and 
we have been using that notation in this chapter. But we have the 
notation [a] for the equivalence class of a, that we used in Chapter 
1, and we will feel free to use that notation here as well, as it is 
sometimes helpful to emphasize the fact that we are dealing with 
equivalence classes when we are considering elements of G/N. © 


Lemma 2.4.10. Let G be a group and let N be a normal subgroup 
of G. Let t: G— G/N be defined by x(a) = [a]. Then 7 is a homo- 
morphism. 


Proof. Let a,b ¢€ G. Then x(a) 7(b) = [aj[b] = (aH)(bH) = abH = 
[ab] = m(ab). 


Definition 2.4.11. The map 7: G > G/N of Lemma 2.4.10 is the 
quotient map or the canonical projection of G onto G/N. © 


We now give some more examples of quotient groups. 


Example 2.4.11. Let G = Dg as in Example 2.3.22 and let N = 
{e, a, a7} as in that example. Then N < G and [G: M = 2, so 
G/N is a group of order 2. Indeed, as we saw there, N has two left 
cosets: N itself, and {8,a8,a78}. Then G/N has two elements, [e] 
and [@], and 7: G— G/Nis given by 7(e) = (a) = 1(a”) = [e], and 
n(G) = n(aB) = 1(a7B) = [6]. Note that [6]? = [e] as 6? =e € N. 
You may well ask why did we choose [e] and [3] as the elements of 
G/N. The answer is simply that we had to make a choice, and, once 
again, any choice is as good as any other. We follow convention by 
choosing e as the left coset representative of N. But we could equally 
well choose [a3] instead of [6] as the other representative, so G/N 
would have elements [e] and [a], and [a8]? = [e] as (aG)? = e € 
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N. Similarly we could have chosen [a7] as the other representative, 
and [a2]? = [e] as (a2B)? =e€ N. 

In fact, this generalizes without change to Do,. Let N = 
{e,a,...,0°1}. Then N < G and [G: N = 2, so G/N is a group of 
order 2, G/N = {[e], [G]} with [@]? = [e] (and again we could replace 
[8] by [a*6] for any i). If n = p is prime, N is the only nontrivial 
proper normal subgroup of G. If n is composite, there are others. © 


Example 2.4.12. Let G = Qs: be the quaternion group of order 8 as 
in Example 2.1.15. G has the following four nontrivial proper normal 
subgroups: 


F=f) 
Ci = {1,4,=1,—} 
C5 = Oe Feel Pee 
Cy = {1,k, -1, —k}. 


First let us consider G/C;. Since [G: C;] = 2, G/C; is a group of 
order 2. We choose coset representatives so that G/C; = {[e], [7]} 
and we observe that [j]? = [e] as #7 = —1 € C;. Again we remark 
that this is simply a choice and we could have chosen different coset 
representatives if we had wanted to. Similarly, G/C; = {[e], [k]} with 
[kK]? = [e] and G/C, = {[e], [2]} with [7]? = [e]. Note that as abstract 
groups these are all isomorphic, and indeed are all isomorphic to Zo 
(compare Corollary 2.3.21). 

Now let us consider G/Z. Since [G: Z] = 4, G/Z is a group of 
order 4. We note that G/Z = {{+1}, {+7}, {+7}, {+k}}, so, choosing 
coset representatives, we write G/Z = {[1], [i], [7], [k]}. Now [7]? = 
[#7] = [-1] = [1] and similarly [j]? = [1], [k]? = [1]. Moreover [#][j] = 
[ij] = [&] and [jJ[é] = [ji] = [-4] =[&]. Similarly [j][&) = (AIG) = [) 
and [k][i] = [¢][k] = [yj]. Thus G/Z has “multiplication table” 


and as an abstract group G/Z is isomorphic to Zz x Zo. © 
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Example 2.4.13. Let G; and G2 be arbitrary groups and let G = 
G1 x Gy. Let Ny = G, x {e}. Then Nj is a normal subgroup of 
G, and so we have the quotient group G/N ,. Note that {(e, g2) | 
g2 € Gy} is a complete set of left coset representatives of Nj in G, 
and so G/N, = {[(e, g2)| | g2 € Go}. It is straightforward to check 
that the group operation in G/Nj is given by [(e, g2)] [(e, 94)] = [(e, 
g294)|. An entirely analogous situation holds if we let No = {e} x Gg 
and consider the quotient group G/N. © 


Here is a result we promised the reader in the last section. 


Lemma 2.4.14. Let N be a normal subgroup of G. Then N is the 
kernel of a homomorphism. 


Proof. WN is the kernel of the quotient map 7: G > G/N. 


Remark 2.4.15. Let G bea group and let N be a subgroup of G. As 
we have seen, the multiplication on G/N is given by (aH) (bH) = 
(ab) H, or equivalently [a] [b] = [ab]. But it cannot be emphasized 
strongly enough that this does not imply (a)(b) = ab. Indeed, if a’ 


is any element of aH, i.e., if [a’] = [a], and if 0! is any element of bH, 
ice., [b'] = [b], and if c’ is any element of (ab)H, i.e., [¢] = [ab], then 
[a’] [b'] = [c] but not necessarily (a’) (b') = ¢. 

As a concrete example of this, let us take G = Z, N = 10 Z, so 
G/N = Zo = {[0], [1],.-., [9]}. Then, sure enough, [3] + [4] = [7] and 
3+4=7, but [6] + [7] = [3], even though 6+ 7 43. .) 


Remark 2.4.16. This having been said, it is sometimes possible 
to choose left coset representatives so that we do have (a)(b) = ab 
whenever [a] [b] = [ab]. With some thought, we can see that this will 
be the case if and only if there is a subgroup H of G with 7: H > G/N 
an isomorphism. (We would be choosing our coset representatives to 
be elements of H.) This is not possible in the situation of Example 
2.4.5 or Example 2.4.12. But it is possible in the situation of Example 
2.4.11. We could choose H = {e, 3}, or, as we mentioned, H = {e, 
aB} or H = {e, a7}. It is also possible in the situation of Example 
2.4.13. In the case of N = G, x {e}, we choose H = {e} x Go, and 
in the case of N = {e} x Go, we choose H = G; x {e}. 

Another place in which you may have encountered a very analo- 
gous situation is that of vector spaces. Let V be a vector space and 
let U be a subspace of V. Let W be any complement of U. Then a: 
W — V/U is an isomorphism. So this situation always occurs in the 
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case of vector spaces, but is an exceptional situation in the case of 
groups. © 


We now proceed to investigate this situation, but on the way we will 
obtain some results interesting in their own right. 


Lemma 2.4.17. Let G be a group and let H, and Hg be subgroups 
of G. Then 


| H; || | 


Ay, Ho| = ———.. 
| : 2| |H, N Ay| 


Proof. Let K = H, x Hy = 1 (hx; hg) | hy € Ay, ho € Hp} 
and note |K| = || |H2|. Let ~ be the relation on K given by 
(hi, ho) ~ (hi, hh) if hihg = hihh. It is easy to check that ~ is 
an equivalence relation, and the equivalence classes of K under ~ 
are in 1 — 1 correspondence with the elements of H;H2 (where the 
equivalence class of (h1, hz) corresponds to hyhzg € HH), so the 
number of equivalence classes is |H H |. Now for any element ho of 
A, Aa, (Ris hy) = (hiho, ho tha) ~ (hy, ha) as (hi ho) (hg ‘h2) = 
hyhz. Furthermore, every (hi, h) in the equivalence class of (hi, ha) 
arises in this way, as we see from the following computation. Suppose 
hi hy = hyhg. Then hy thi, = ho(hy)~!. Call this common value ho. 
Since Hy is a subgroup, ho = een © Hj, and since Ho is a sub- 
group, ho = ho(hh)~! € Ho. Thus ho € Hi N Ho. But ho = hy th 
gives hi, = hiho and ho = ho(hh)~! gives hh = hohe. Thus every 
equivalence class has |H,MHg| elements and so we see that |H,| || 
= |HyHo| |H1 Agdl, yielding the result. 


Lemma 2.4.18. Let H, and Ho be subgroups of G with H, Hy = 
HyH,. Then H,Ho2 is a subgroup of G. In particular, if at least one 
of Hy and Ho is normal, then HH» is a subgroup of G. 


Proof. We must verify the properties of a subgroup. 
(Closure) Let hyhg € Hy Hp and hi,h, € Hy H2. Then 


(Ay hz)(hi, hy) = hy (hghi,)h = hy (hi hg) hy = (hh) (hg hy) € Hy He 


where hgh), = hhh for some hf € Hy, hy € Ha as Hy Hy = HoH. 
(Identity) e = ee € Hi Hy. 

(Inverses) Let hihg € HyH2. Then (hyhg)~! = hy'hy! € Hy He as 
Hy = HoH. 
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Now suppose that at least one of Hy and Hy is normal. We give 
the argument in case H2 is normal; the argument for H; normal is 
entirely analogous. Since H2 is normal in G, hyH2 = Hoh, for every 
h, € Hy, (in fact, for every hy € G, but we don’t need that here), by 
Lemma 2.3.25, and so H, Hy = HoH). 


Example 2.4.19. If neither H, nor Hy» is normal, then H;H2 may 
not be a subgroup of G. For example, let G = Dg. Then {e, a}{e, 
aB} = {e, a, a8, a6} is not a subgroup of G. % 


Lemma 2.4.20. Let Ny and Ng be normal subgroups of G with Nyon 
No = {e}. Then ning = nom for every nm € Ni, no € No. 


Proof. Co nyn2n; , ts On the one hand, nj nonj{ a iy a 
ni(ngnz ny 1 € N; as Nis G. On the other hand, N{NgQN{ ae a 
(ningn7') n 51 € Na as Noa G. Hence ny ngny, rie E NNN = ae 
Le, Nyngnz tig = = 6 (miner, hats) (ngn1) = e (neni), nyn2g = 
nn, as claimed. 


Definition 2.4.21. Let N; and N2 be normal subgroups of G with 
G = NiNo2 and Ni No = {e}. Then G is the direct product, G = 
N, x No of Ny and No. © 
Example 2.4.22. Let G; and G2 be groups. Then G = G1 x G2 has 
normal subgroups Ni; = G x {e} and No = {e} x Go with Ny No = 
G and N,M No = {e}. Then we have an isomorphism y: G, x Gz > 
G = N1N2 given by (91, 92) = (91, €)(€, 92). 


Example 2.4.23. Let m and n be positive integers. Suppose 
that m and n are relatively prime. Let G = Zmy. Let Ny = 


{[0], [n], [2nJ, ereting [(m aon 1)n]} and let No= {[0], [m], [2m], aa) [(n -_ 
1)m]}. Note that Ny is isomorphic to Z,, and N»2 is isomorphic to 
Zn. ‘Then G is the direct product of Ny and No. © 


Definition 2.4.24. Let H be a subgroup of G and N be a normal 
subgroup of G with G = HN and H 1 N = {e}. Then G is the 
semidirect product, G = H « N, of H and N. (We also write G = N 
x H.) rv) 


Example 2.4.25. If G is the direct product of Ny and No (as in 
Definition 2.4.21) then G is the semidirect product of Ny and N2.0 


Here is a lemma that makes it easier to check that G is a semidirect 
product. 
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Lemma 2.4.26. Let G be a finite group. Let H be a subgroup of G 
and N a normal subgroup of G with |G| = |H| |N|. If G = HN or H 
ON = {e}, then G is the semidirect product of H and N. 


Proof. This follows directly from Lemma 2.4.17. 


Example 2.4.27. (a) Let G = Dg, let H = {e, 3}, and let N = {e, 
a, a*}. Then G =H x N. 

(b) More generally, let G = Dg, let H = {e, 6} and let N = {e, a, 
uc, }. Then G=H «x N. 

(c) Let n be a positive integer. We have the following subgroups of 
GL,,(R): 

U = {invertible upper triangular matrices} 

Uo = {upper triangular matrices with all diagonal entries equal to 


1}aU 
D = {invertible diagonal matrices} C U. 
Then U = Dx Up. 0) 


Here is the most general construction of semidirect products. 


Lemma 2.4.28. Let H and N be groups and let ®: H — Aut(N) be 
a homomorphism. For h € H, let pp, = ®(h), so that yn: N—- N is 
an automorphism of N. Let G = Hx N with the group operation on 
G being given by 


(ha; m1) (ha, nz) = (hy he, Pnzi(m)na). 


Then G is a group and G is the semidirect product of the subgroup 
Hx {e} of G and the normal subgroup {e} x N of G. 


Proof. First we must check that G is a group. Because of the (com- 
plicated) definition of the group operation, this will require careful 
consideration. 

(Closure) Clear from the definition. 

(Associativity) We compute 
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(hi, m1)[(h2, m2)] (ha, m3)] = (ha, m1) (haha, Pp_z1 (n2)na) 
= (hy hgh3, (haha)? (M1) Pp, (Ma)n3) 


and these are equal. 

(Identity) The identity element of G is (e, e). 

(Inverses) The inverse of the element (h, n) of G is (h~+, yp_(n)—!) 
as 


(h,n)(h-*, pa (n)~*) = (AR, pan) yn(n)~*) = (€, €), 
(A~*, pa(n)~*)(h,n) = (ATR, Pr-1 (Grn) *)n) 
= (h*h, pr-1(pr(n)*)n) 
= (h*h,n*n) = (e, €). 


Now that we know G is a group we see that (h, e) (e, n) = (h, n) 
and also that H x {e} N {e} x N= {(e, e)}. 

It remains to show that {e} x N is a normal subgroup of G. We 
compute 


(ho, no)(e,n)(ho, no) * = (ho, no)(e,n)(hg',n1) where ny EN 
= (ho, no) (hg ',n2) where no € N 


= (e,n3) where ng € N 


as required. 


We isolate some of the salient computations. 
Corollary 2.4.29. In the situation of Lemma 2.4.28, 


(a) (hi, e) (e, m2) = (hi, n2) 
(b) (é m)(ha, €) = (ha, ,51(m)) 
(c) (h, e)(e, n)(h, e)~* = (e, pn (n)) 


Proof. These are special cases of the above computations. 


Corollary 2.4.30. Let G be a semidirect product G = H « N as in 
Definition 2.4.24. Then G can be obtained from the construction of 
Lemma, 2.4.28. 


Proof. Define 6: H — Aut(N) by ®(h) = yp where yp(n) = 
hnh-!. 
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Example 2.4.31. Let N be the cyclic group Z, = {[0],...,[n—1]} 
and let H = Zz = {[0], [1]}. Define ® by yj) ([A]) = [-A]. Then G = 
HN is isomorphic to the dihedral group D2,,. (Here we have written 
the groups additively. Switching to multiplicative notation, N = {e, 
a,...,a" 1}, H = {e, B}, and (using Corollary 2.4.30) yg(a) = 
BaB-! =aM!, giving us Don). 


Corollary 2.4.32. In the situation of Lemma 2.4.28, 
mw: Hx {e} > G/{e} x N 
defined by m(h, e) = [(h, e)] = (h, e)({e} x N) is an isomorphism. 


Proof. From the definition of multiplication in G, we see that 7 
is a homomorphism. Since {(h, e) | h € H} is a complete set of left 
coset representatives of {e} x N,7a is onto. Also, 7((h, e)) = [(e, e) 
implies h = e, so Ker(z) = {(e, e)} and hence 7 is 1 — 1. Thus 7 is 
an isomorphism. 


We now give a different perspective on semidirect products. 


Lemma 2.4.33. Let G be a group and let yp: G— Q be a homomor- 
phism from G onto the group Q. Suppose there is a subgroup H of G 
such that ply: H > Q is an isomorphism. Then G is the semidirect 
product of H and the normal subgroup N = Ker(y). 


Proof. We must show that G = HN and that HN N = {e}. 

G = HN: Let g € G and let q = (q). Since y|y: H > Q is an 
isomorphism, in particular it is onto, and so there is an element h 
€ H with y(h) = q. Then g(h-*g) = y(h')e(g) = v(h)'p(g) = 
q-'q = e,son=h'g € Ker(y) = N. But then g = hn. 

HON = {e}: Let g € HON. Since g € N = Ker(y), v(g) = e. 
Since y|y: H —> Q is an isomorphism, in particular it is 1 — 1, and 
so, since g € H with y(g) = e, we must have g = e. 


There is another way of looking at this situation. 


Definition 2.4.34. Let y: G — Q be a homomorphism. A homo- 
morphism A: Q > G is a splitting of y if pA: Q > Q is the identity. 
If y has a splitting, then y splits. © 
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Lemma 2.4.35. 


(a) Let G=H« N be the semidirect product of the subgroup H and 
the normal subgroup N. Then the quotient map 7: G— G/N 
splits. 

(b) Suppose that yp: G > Q splits. Let be a splitting of yp. Then 
G is the semidirect product G = H x N where H = X(Q) and 
N= Ker(y). 


Proof. 


(a) We observe that 7|#, the restriction of 7 to H, is an isomorphism 
nly: H + G/N. Let A = (a|z)71. 

(b) We know that N < G as N is the Kernel of a homomorphism. 
We must show G = HN and HN = {e}. 


G = HN : Let g € Gand let qg = y(g). Let h = X(q) and set n = hg, 
so g = hn. Certainly h € H. Also, y(n) = y(h-4g) = v(h)y(g) = 
y(h)*9(g) = 9(i(a)) *9(g) = gg =esoneN. 

HON = {fe}: Let g € HON and let gq = y(g). On the one 
hand, g € N so y(g) = e. On the other hand, g € H and yA: Q > 
Q is the identity, so in particular y|y (the restriction of y to H) is 
1-1. Since y(g) = e, we must have g = e. 

Thus G is the semidirect product of H and N, as claimed. 


Example 2.4.36. 


(a) For R = Z,Q, or R, let GL7(R) = {A € GL, (R) | det(A) > 0}. 
Then GL;*(R) is a normal subgroup of GL,(R) of index 2. 
For r € R, r 4 0, let sign(r) = 1 if r > 0 and sign(r) = —1 if r 
< 0. Then we have a homomorphism y: GL,(R) + {+1} given 
by (A) = sign(det(A)), and Ker(y) = GL;*(R). This homomor- 
phism y has a splitting given by 


| 


If we let H = {X(1), A(—1)} then Z is a subgroup of GL,,(R) of 
order 2 and GL,(R) = H x GLI(R). 
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(b) In this situation, suppose now that n is odd. Then vy has another 
splitting given by 


Now let H’ = {X'(1), A’(—1)}. Then we can check that H’ is a 
normal subgroup of GZ,,(R), or that every element of H’ com- 
mutes with every element of GL;‘(R). Either of these facts tells 
us that for n odd, GL,(R) is the direct product GL,(R) = 
H' x GL (R). ) 


We close with an observation. 


Remark 2.4.37. If G is abelian and G is the semidirect product of 
Hand N, then G is automatically the direct product of H and N. 


2.5 The Noether isomorphism theorems 


In this section, we will prove some basic, and widely applicable, the- 
orems about isomorphisms of groups. 


Definition 2.5.1. Let 0: G— H be a homomorphism from G onto 
HT. A homomorphism y: G > K factors through o if there is a 
homomorphism 7: H + K with y= To. 

It is illuminating to consider the following diagram: 


9 


G ———> kK 


ar 
| oes 
“7 T 


H-- 


Then y factors through o if we can fill in the dotted arrow to a 
solid arrow. .) 


Lemma 2.5.2. In the situation of Definition 2.5.1, y factors through 
o if and only if Ker(o) C Ker(y). 


Proof. First suppose Ker(a) C k(y). We define 7 as follows: Let 
h © H. Then h = o(g) for some g € G. Let 7(h) = y(g). 
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We have to see that this definition makes sense, i.e., that it only 
depends on / and not on the element g we have chosen. So suppose 
h = o(g') for some g’ € G. We need to show that y(g’) = ¢(g). 
Now e = hh™! = o(g')o(g)* = o(g')o(g*) = a(9/g"*) 80 g'g"! € 
Ker(c) C Ker(y). Thus g’g~! € Ker(y), i-e., y(g'g~') = e. But e = 
9(9'9-*) = 9(9')9(9-") = 9(9')9(g)* 80 Y(g') = (g) as required. 

Next suppose that y factors through o and let y = Ta. Let g € 
Ker(o). Then y(g) = to(g) = tT(o(g)) = T(e) = e so g € Ker(y). 
Thus Ker(o) C Ker(y). 


Theorem 2.5.3. (First isomorphism theorem) Let y: G — H 
be a homomorphism. Then Im(y) is isomorphic to G/Ker(y). 


Proof. Let 7: G— G/Ker(y) be the quotient map. Then Ker(z) = 
Ker(y) so by Lemma 2.5.2 y factors as y = Gr. 

First we claim that Im(y) = Im(¢). If h € Im(y), so h = y(q) for 
some g € G, and then h = ¢z(g) = G(a(g)) so h € Im(@). On the 
other hand, suppose h € Im(¢), so h = g(g) for some g € G/Ker(z). 
But 7 is onto, so g = 7(g) for some g € G, so h = G(m(g)) = v(g) so 
h € Im(y). Thus, if we let Hp = Im(v), then ¢: G/Ker(y) — Hp is 
onto. 

Next we claim that ¢: G/Ker(y) > Hp is 1-1. Let g € G/Ker(y). 
Then by the definition of ¢, (9) = y(g) where g is an element of G 
with a(g) = g. Thus e = (9) & e = (9g) & g € Ker(y) & m(g) =e. 

Hence g: G/Ker(y) — Ho is both 1-1 and onto, ie., is 
an isomorphism. In particular, G/Ker(y) and Hp = Im/(y) are 
isomorphic. 


Theorem 2.5.4 (Second isomorphism theorem). Let H be a 
subgroup of G and let N be a normal subgroup of G. Then H/HAN 
is isomorphic to HN/N. 


Proof. First recall that in this situation, HN is a subgroup of G. 
Then, since N is a normal subgroup of G, it is certainly a normal 
subgroup of HN. Also, it is straightforward to verify that if N is a 
normal subgroup of G, then HM WN is a normal subgroup of H. Thus 
both H/HMN and HN/N are groups. 

Let 7: G —+ G/N be the quotient map and let 7 be the restriction 
of « to H. From the fact that Ker(7) = N it is straightforward 
to check that Ker(7) = HN. Thus from the first isomorphism 
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theorem H/H ™N = H/Ker(z0) is isomorphic to Im(zo). Now 
mt: G + G/N is defined by z(g) = gN, so Im(mo) = {AN |h © FH}. 
But {AN |h € H} is just HN/N, yielding the theorem. 


Theorem 2.5.5 (Third isomorphism theorem). Let H and K 
be normal subgroups of G with kK C H. Then H/K is a normal 
subgroup of G/K and G/H is isomorphic to (G/K)/(H/K). 


Proof. Let 7x be the quotient map 7x: G > G/K and let wy be 
the quotient map my: G > G/H. Then Ker(7x) = K and Ker(7#7) = 
H so Ker(axK) C Ker(a7). Thus by Lemma 2.5.2, y factors through 
TK, 1.e., TH = 7'TK for some homomorphism 7’: G/K > G/H. 

(In other words, we have the diagram 


T 
a a 
| 
TT 


G/H 


where we have filled in the dotted arrow.) 

From the first isomorphism theorem, we know that Im(z’) is iso- 
morphic to (G/K)/Ker(z’). Thus we must identify these two sub- 
groups. 

Im(z’): We claim Im(z’) = G/H. We know that 7 is onto, and in 
the situation of a factorization, that implies 7’ is onto. (Let g € G/H. 
Then g = 7(g) for some element of G, ie., g = 1'77K(g) = 7' (7K (Qg)) 
and so g € Im(z’).) 

Ker(z’): We claim Ker(z’) = H/K. (Note this implies that H/K 
is a normal subgroup of G/K.) By the definition of the quotient 
maps, ty(g) = gH and mx(g) = gK. Then, by the definition of 
mw’, ™'(gK) = gH. Remember that the identity element of G/H 
is the left coset H. Thus gK € Ker(7’) © gH = Hoge H. 
Thus Ker(z’) = {hk |h € H}. But {hK |h © H} is just H/K, as 


claimed. 


Theorem 2.5.6 (Correspondence theorem). Let N be a nor- 
mal subgroup of G and let t: G — G/N be the quotient 
map. For any subgroup H of G with N C HA we have 
the restriction tly: H — H/N. This gives a 1-1 correspon- 
dence between S, = {subgroups of G containing N} and Sy = 
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{subgroups of G/N}. Furthermore, this restricts to a 1-1 correspon- 
dence between T, = {normal subgroups of G containing N} and 
T2 = {normal subgroups of G/N}. 


Proof. Clearly if H € S; then m|q(H) € So. Also Hy C Hy & 
T\H,(41) C a\H,(H2) and similarly Hy C Hy © a|y,(He) C 
T\H,(H1) so Hy = He & a\y,(Mi) = a\H,(H2) and so the map 
from 5S to S> is 1-1. Also, if H € S2 then, setting H = 7~'(H), we 
have |47(H) = H so the map from Sj to Sz is onto, and hence we 
have a 1-1 correspondence between $; and So. 

Now suppose H € 7}, so that H is a normal subgroup of G. 
Then for any a € G, aHa-! = H. Thus for any aN € G/N, 
(a/N)(H/N)(aN)~! = (aHa!)/N = H/N so H/N is a normal 
subgroup of G/H, ie., H/N € T 2. Conversely, if H/N € Th, 
so that H/N is a normal subgroup of G/N, let m : G/N -— 
(G/N)/(H/N) be the quotient map. Also, we have an isomorphism 
i:(G/N)/(H/N) > G/H. Let 2’ = inyr: G > G/H be the compo- 
sition. Then H = Ker(z’) is a normal subgroup of G, ie., H € T}. 
Thus we also have a 1-1 correspondence between 7) and T». 


2.6 The structure of finite, and finitely generated, 
Abelian groups 


In this section, we first completely determine the structure of finite 
abelian groups, and then completely determine the structure of infi- 
nite, but finitely generated, abelian groups. Along the way, we shall 
prove some results of interest in themselves. 

Since we will only be dealing with abelian groups in this section, 
we will write all groups additively. Also, if G= G, x Go is the direct 
product of G; and Go, we will write G = G; ® Go and call G the 
direct sum of Gy, and Gp. 

We begin with finite groups. 


Theorem 2.6.1 (Cauchy’s theorem for abelian groups). Let 
G be a finite abelian group of order n, and let p be a prime dividing 
n. Then G has an element of order p. 


Proof. Let n = pk. We prove the theorem by complete induction 
on k. If k = 1, then n = p. We have proved (Corollary 2.3.21) that in 
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this case G is cyclic. Let a be any element of G except a = e. Then 
a has order p. 

Now for the inductive step. Let G have order n = pk with k> 1. 
Choose any element a of G other than a = e. Then a has order 
m for some positive integer m with m > 1 and m dividing pk (by 
Corollary 2.3.20). 

There are two possibilities: 


(i) p divides m. In that case, let b = (m/p)a. Then 6 is an element 
of G of order p. 

(ii) p does not divide m. In that case, let H be the subgroup gener- 
ated by a. Then H has order m. Since G is abelian, H is normal, 
so G/H isa group. Let 7: G > G/H be the quotient map. Then 
|G/H| = n/m is still divisible by p, so |G/H| = pj for some j < 
k. By the inductive hypothesis, G/H has an element 6 of order 
p. Let c be any element of G with z(c) = b. Then the order of 6 
divides the order of c, so c has order pq for some gq. Let d = qe. 
Then d is an element of G of order p. 


Remark 2.6.2. Cauchy’s theorem is true for arbitrary finite groups, 
not just abelian ones. We have proved it here in the abelian case 
because this case is simpler. The proof in the general case requires 
new ideas. But we will prove it in general later (as Theorem 2.9.1). © 


Lemma 2.6.3. Let G be an abelian group and let a and b be elements 
of G of orders m and n respectively, with m and n relatively prime. 
Then c= a+ 6 is an element of G of order mn. 


Proof. First observe that (mn)c = (mn)(a+ 6) = (mn)a+(mn)b = 
n(ma) + m(nb) = n(0) + m(0) = 0. Thus c has order k for some 
k dividing mn. Now 0= ke = k(a+ 6) =ka+kbso kb = k(—a). 
Now the order of k(—a) divides the order of —a, which is the order 
of a, which is m, and the order of kb divides the order of 6, which is 
n. Hence the order of kb = k(—a) divides gcd(m,n). But m and n are 
assumed to be relatively prime, so gcd(m,n) = 1, and kb = k(—a) has 
order 1, i.e., kb = k(—a) = 0. But then, since k(—a) = 0, m divides 
k, and since kb = 0, n divides k. Again, m and n are assumed to be 
relatively prime, so mn divides k. Hence k = mn, as claimed. 
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Remark 2.6.4. This lemma is really Theorem 2.2.15 “in disguise” 
but rather than showing how it is equivalent to that theorem, we 
decided to prove it directly, for the sake of simplicity. © 


Theorem 2.6.5. Let G be a finite abelian group and let e be the 
exponent of G, i.e., the least common multiple of the orders of the 
elements of G. Then G has an element of order e. 


Proof. Factor e as e = pu pe a pak where p1,..., Dz are distinct 
primes. Then G has an element a; of order pi qi, for some q;, for each 
i= 1,...,k. Then qa; has order pe, for each i. Since pe, dae pe 
are pairwise relatively prime, we see, applying Lemma 2.6.3 induc- 
tively, that b = qya,1 + qoag +--- + qzay is an element of G of 
order e. 


Corollary 2.6.6. Let G be an abelian group of order n. The following 
are equivalent: 


(i) G ts cyclic. 
(ii) For any integer d dividing n, G has exactly d elements whose 
order divides d. 
(iii) For any integer d dividing n, G has at most d elements whose 
order divides d. 


Proof. Let e be the least common multiple of the orders of the 
elements of G. From Corollary 2.3.20, we see that e divides n. There 
are two possibilities: 

e = n: In this case, by Theorem 2.6.5, G has an element of order 
n, so G is acyclic group of order n, i.e., G is isomorphic to Z,. Thus 
(i) is true. It is straightforward to check that all the subgroups of 
Zn are as follows: Let d be any integer dividing n. Then there is a 
unique subgroup of Z,, of order d, whose elements are {[0], [n/d], 
[2(n/d)],...,[(d—1)n/d]}, from which (ii) and (iii) follow. 

e # n: In this case G is not cyclic (as if G were cyclic, it would 
have an element of order n) which would imply that e = n. Thus (i) 
is false. But ea = 0 for every a € G, so G has n > e elements whose 
order divides e, and (ii) and (iii) are false as well. 


We will now investigate the structure of Z,, @ Zp in general. 
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Theorem 2.6.7. Let G = Zm ® Zn with m and n positive inte- 
gers. Let g = gcd(m,n) and l= Iem(m,n). Then G is isomorphic to 
Z, 8 Zyq- 


Proof. Denote the element (1, 0) of Z,, ® Z, by a and the element 
(0, 1) of Z,, @ Z, by b. Let y = a + b. If ky = 0, since G is the direct 
sum of the subgroup generated by a and the subgroup generated by 
b, we must have ka = 0 and kb = 0, and this will be true if and only 
if both m and n divide k, i.e., if and only if / divides k. Thus y is an 
element of G of order /, so y generates a cyclic subgroup N, of G of 
order /, i.e., isomorphic to Z,. 

Write m = gm’ and n = gn’. Then m’ and n’ are relatively prime, 
so there are integers s and ¢t with m’s + n/t = 1. Let z = —m’sa + 
n' tb. 

You may well ask where does this choice of z come from? Note 
that we have a symbolic matrix equation 


T= Lens wel La] 


and if we let A be this 2-by-2 matrix, A has determinant 1. This 
implies that A is invertible over the integers, i.e., that A~! is a matrix 
with integer entries. We will crucially use this fact soon. 

We claim that z is an element of G of order g. To see that, 
first observe that gz = g(—m’sa) + g(n'tb) = —s(gm'a) + t(gn'b) = 
—s(ma) + t(nb)= —s(0) + t(0) = 0, as a has order m and b has 
order n. Thus if z has order r, then r divides g. But suppose that 
rz = 0. Then r(—m'sa) + r(n’tb) = 0. Since G is the direct sum 
of the subgroup generated by a and the subgroup generated by 8, 
we must have r(—m’sa) = 0 and r(n’tb) = 0, or (m’rs)a = 0 and 
(n'rt)b = 0. Since a has order m and 6 has order n, we must have 
that m = m'g divide m’rs and that n = n’g divides n’rt, and so g 
divides rs and g divides rt. But, since m’s + n't = 1, s and ¢ are 
relatively prime, so that implies that g divides r. Hence r = g. Thus 
z generates a cyclic subgroup N, of order g, i.e., isomorphic to Zy. 

We claim that G is the semidirect product of N, and Nz. Since G 
is abelian, this semidirect product is automatically the direct product 
(Remark 2.4.36), or direct sum, as we are calling it here, G = N, © 
N,, isomorphic to Z; ® Zz. 
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To see this we must verify that G = N, + N-, (we are writing the 
group additively) and that Ny, M Nz = {0}. 

G = N, + N;: Referring to the above symbolic matrix computa- 
tion, we see that 


ee le 
b z m's 1 z| 
Le, a = n'ty —z and b = m'sy + z. 

Thus a and 6 are both in N, + N,. Since Ny + Nz is a subgroup 
of G, and a and b generate the group G, we must have G = Ny + 
Nz. 
Actually, at this point we are done. Note |G| = mn = lg = |Ny| 
|N.| so by Lemma 2.4.26, G is the semidirect product of Ny and 
N-., as claimed. 

But, although it is not logically necessary, it is illuminating to 
verify independently that N, 1 Nz = {0}, so we shall do so. 

Ny 1 Nz = {0}: Let « € N,N Nz. Then x = jy for some j and 
x = kz for some k. Thus jy = kz and so j(a + 6) = k(-—m'sa + n’tb). 
Once again, since G is the direct sum G = Z,, © Z, of the subgroup 
generated by a and the subgroup generated by b, we must have ja = 
k(—m' sa) and jb = k(n'tb), i.e., (j + km’s)a = 0 and (j — kn't)b = 0. 
Since a has order m and b has order n, we must have that m divides 
j + km's and that n divides j — kn’t. Since g divides both m and n, 
we have that g divides both 7 + km’s and j — kn’t, and hence that 
g divides their difference (j + km’s) — (j — kn’t) = k(m's + n't) = 
k(1) =k. Since g divides k, and z has order g, we must have kz = 0. 
Thus x = 0 (and jy = 0 so | divides j) as claimed. 


Remark 2.6.8. Theorem 2.6.7, and its proof, are valid for any m 
and n. But in case m and n are relatively prime, then / = mn and 
g = 1. In this case the first paragraph of the proof (again) yields 
that Zn, © Zy is isomorphic to Zp». But if we go through the whole 
proof, we find that Z,, @ Zy is isomorphic to Zmn 6 Z1. But Z, is 
the trivial group, so Zmn © Z 1 is isomorphic to Zmy. © 


Example 2.6.9. 


(a) Let N be a positive integer. Then any factorization of N as 
N=mn with m and n relatively prime gives an isomorphism of 
Zn with Zm ® Zp. For example, if N = 30, we have that Zao 
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is isomorphic to Zi5 © Zo, Zio © Z3, and Ze © Zs. (Z30 is also 
isomorphic to Z39 @ Z1, but again that is trivial since Z is the 
trivial group.) 

(b) Let N be a positive integer and let N = mn. Then any factor- 
ization of N as N = m’n’ with ged (m’, n’) = gcd(m, n) and 
lem(m’, n’) = lem(m, n) (in fact, these two conditions are equiv- 
alent) gives an isomorphism of Z, © Zp with Zm: ® Zp. For 
example, if N = 4320 = 180 - 24 = 120 - 36 = 360 - 12, Zig9 © 
Za is isomorphic to Zj29 @ Zs and Z369 © Zy2. © 


In preparation for proving the first main theorem of this section, 
we derive a result that we will use in that proof. 
We begin with a definition and a result that should remind 


you very much of vector spaces — indeed, the situation here is 
entirely analogous. 
Let Z” = Z x--- x Zwhere there are n factors. Let e1,...,€n be 


the elements of Z” given by e; = (1, 0,...,0), eg = (0, 1,...,0),..., 
en = (0, 0,...,1). Note that every element a = (a1,...,@,) of Z” can 
be written as a = aye, +--: + Gpnen and this expression is unique. 


Definition 2.6.10. An abelian group G is free abelian group of rank 
n if G is isomorphic to Z”. If a: G + Z” is any isomorphism, let 
bh, = a~(e1), bg = a (Gp) ,00« 50% = a7 l(en). Then 1 Upc n a Oat 
is a basis of G. © 


We thus see that any element g € G can be written uniquely as 
g = 4161+ --- + Qnby, for integers a1,...,a,. Indeed if {b1,...,b,} is 
any set of elements of G with this property, we have an isomorphism 
a: G > Z” given by 


a(a,by +--+ nbn) = (Q1,---, An). 


Now recall that if V is a vector space with basis {v1,...,Un} and 
{w1,...,Wn} are any elements of a vector space W, there is a unique 
linear transformation T: V > W with T(v;) = w; for each 7. Here 
is the analog for abelian groups. 


Lemma 2.6.11. Let G be a free abelian group of rank n and let 
{b,...,0n} be a basis of G. Let H be an arbitrary abelian group 
and let {h1,...,hn} be elements of H. Then there is a unique group 
homomorphism yp: G + H with p(b;) = h; for each i. 
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Proof. First let us consider Go = Z”. 

We start out by defining yo: Go — H “coordinatewise”. That is, 
we let yo(e1) = A1,.-.-,o(en) = An. Then, if a € Go, a= (a1,...,an), 
then a = ay€j+ +--+ + den, so we let 


yo(a) = a yo(e1) +++> + GnYo(en) = aihy +++: + anhn. 


Then yo: Go — H is well-defined, as the expression for a in terms 
of {e€1,...,€n} is unique, and it is straightforward to check that yo 
is a homomorphism. 

Now let G be a free group of rank n, let a: G — Go be an 
isomorphism, so that b; = a~'(e;) for each i=1,...,n. Then we let 
p= Poe. 

Our proof of the lemma was a little indirect. We could have 
directly worked with G without going through Gog. But we have done 
things this way to more directly lead to our next result. 


We now consider the group Zm, X --: X Zm,,, and we let o: Z” > 
Zm, X +++ X Zm,, be the homomorphism 
O(a1,-+-54n) = ([@i}mrs+++> [4nlmn)- 
We let 7 
€1 = a(e1) = ([L]ms OR ? [O]mn)s 
en = o(€n) = ([O]m, [D}ma2, ) [1] mn) 


We observe that Ker(o) = (mZ) x --+ xX (m)Z). 


Lemma 2.6.12. Let G be a group that is isomorphic to Go = 
Zim, X++:X Zm,, and leta: G— Go be an isomorphism. For each 


i =1,...,n, let b; = a! (&). Let H be an arbitrary abelian group 
and let {hj,...,hn} be elements of H. Suppose that the order of h; 
divides m; for each i=1,...,n. Then there is a unique group homo- 


morphism p: G— H with p(b;) = h; for each 1. 


Proof. By Lemma 2.6.11, if Go = Z”", we have a unique group 
homomorphism yo: Go — H. Now the condition that the order of h; 
divides m; for each i gives us that yo (miei) = myo(e;) = mh = 
0 for each i, so Ker(yo) D (mZ) x --- xX (m,Z) = Ker(oc). Then 
by Lemma 2.5.2, yo factors through a, i.e., yo = yo for some homo- 
morphism y: G — H, and this is the desired homomorphism y. 
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Now we have y(b;) = h; for each i, and since every element of 
G can be expressed as a,b; + --- + G@nbp, for some aj,..., Gn, this 
determines y uniquely. 


In thinking about the proof of Lemma 2.6.12, you may wonder 
why we didn’t simply define y: G > H by the formula y(a,b, + 
+++ + dybn) = ayhy +--+ + Qnh, as we did in Lemma 2.6.11. The 
point is that in Lemma 2.6.11, this expression is unique, giving us 
an unambiguous definition of y, but in Lemma 2.6.12 it is not. So 
we had to have some method of showing that the potential ambi- 
guity in fact doesn’t matter. We set things up so that we could use 
Lemma 2.5.2 to do so. 

Now we come to the first main result of this section, the struc- 
ture theorem for finite abelian groups. This tells us that every finite 
abelian group G is isomorphic to the direct sum of cyclic group Ze, 
® Zep B+: GB Ze,. We would like to conclude that these integers 
€1, €2,.--, €, are unique, giving us a classification of finite abelian 
groups, but we have already seen that they are not (compare Exam- 
ple 2.6.9). We can readily see one potential source of ambiguity: We 
do not want to allow any e; to be 1, as that would give us a trivial 
Z, summand, which we could eliminate without changing the iso- 
morphism class of G. (You can think of this as being analogous to 
factoring positive integers. We would not want to consider 2-3, 2-3-1, 
2-3-1-1,...to be distinct factorization of 6.) But even without that, 
Example 2.6.9 shows there is a lot of other ambiguity. However, if 
in Example 2.6.9 we exclude e; = 1 then in (a) we have the unique 
expression Z39. If we also require in (b) that eg divides e; then we 
have the unique expression Z3¢69 © Zy2. It turns out that the analog 
(or, more precisely, the generalization) of this condition is enough to 
give us uniqueness. 

But this is really step 2 of our main result. Step 1 is to prove 
existence, that G is isomorphic to the direct sum of cyclic groups, 
and afterwards we will prove uniqueness. However, this is one of the 
(not uncommon) situations in mathematics where it turns out to be 
easier to prove a stronger (in our case, more specific) result. Rather 
than trying to show that G is isomorphic to some direct sum of 
cyclic groups, it is easier to show that G is isomorphic to a direct 
sum of cyclic groups whose orders satisfy our condition. So that is 
what we shall do. 
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Theorem 2.6.13 (Structure theorem for finite abelian 
groups). Let G be a finite abelian group. If G is the trivial group, 
then G is isomorphic to Ze, for ej = 1. Suppose that G is nontrivial. 
Then there are positive integers €,,...,e€, (for some k) with e, #1, 
and, ifk > 1, with ej41 dividing e; fori =1,...,k—1, such that G is 
isomorphic to Ze, ®-:-PLZe,. Furthermore, the integers €1,...,€, are 
unique. 


Proof. We first prove existence. We proceed by complete induction 
on n = |Gl. 

If n = 1 then G is isomorphic to Z, and there is nothing further 
to show. Assume now that the theorem is true for all abelian groups 
of order less than n and let G have order n. 

Let e be the exponent of G. If e = n, then by Theorem 2.6.5, G 
has an element a of order n, so in this case G is cyclic of order n, 
i.e., G is isomorphic to Z,, and we are done. 

Suppose that e < n. Set e; = e. By Theorem 2.6.5, G has an 
element a, of order e;. Let H, be the subgroup generated by ay. 
Then H, is isomorphic to Ze,. 

Now consider the group Q = G/Hj. This is an abelian group of 
order n/e1, so by the inductive hypothesis there is an isomorphism 
yp: Q + Ze. B-::PZe, where e, ~ 1 and if k > 3, e41 divides e; 
for i= 2,...,k—1. Let bo € Q with y(b2) = (1, 0,...,0),...,b. € Q 
with y(b;) = (0, 0,...,1). Now the quotient map 7: G —> Q is onto, 
so there are elements a},...,a}, of G with 1(a4) = be,...,7(a,,) = bp. 

Now if a5 had order e2,..., a), had order ex, we would be in busi- 
ness, as we shall see below. But there is no reason to believe that is 
the case, and it may not be, so we will have to modify these elements. 

Fix i > 2 and consider aj. m(a,) = b; and 6; has order e;. Then 
m(e,a,) = en(at) = e,b; = 0. But 7(e;a;) = 0 means e;a, € Hy. Thus 
ea, = k,a, for some k;. Now by the definition of e, = e, e; divides 
e,. Then (e;/e;)(e;a,) = (e1/e;)kia1, ie., era, = (e1/e;)kja,. Now 
the order of any element of G, and in particular of a}, divides e1, so 
e1a, = 0. Thus we see that (e/e) (kja1) = 0, ie., that ka, is an 
element of H; whose order divides e;/e;. But Hj, is a cyclic group of 
order e1, generated by a,, and we know exactly what those elements 
are. They are the elements of the subgroup generated by e;a,. Thus 
we must have that k; = ej; for some integer 4;. 

Assembling this, we see that e;a, = e;j;a1, or that e;(a), — j,a1) = 
0. Thus if we set a; = ai — j,a1, we see that e;a; = 0, so the order 
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of a; divides e;. But m(a;) = m(ai) = 6; has order e, so e; divides 
the order of a;. Hence a; has order e;, and this is true for every 
i=2,...,k, 

Now let H»2 be the subgroup of G generated by ag,...,ax. By 
Lemma 2.6.12. there is a homomorphism A: Q + Ho2 with A(bj) = 
fis 1 = QLeceg ks Them aAb;) = bgt = 2yeoeg hy 60 TA GQ > OQ is 
the identity. In the language of Definition 2.4.34, A is a splitting of 
am. Hence, by Lemma 2.4.35, G is the semidirect product of H, and 
Hy. Since G is abelian, G is then the direct product of H; and Hg 
(Remark 2.4.37), or, as we are calling it here, the direct sum G = Hy 
© Hy». But then, since H, is isomorphic to Z.,, and H2 is isomorphic 
to Ze, ©: PB Ze,, G is isomorphic to Ze, B Ze, O-:: PB Ze, 

Hence, by induction, we have the existence of a direct sum decom- 
position for every finite abelian group G. 

Now we must show that, under our conditions, the integers 
€1,.-.,€; are unique. Thus, suppose G is isomorphic to Ze, ®---PZe, 
and also to Zp, ®---®Zyz, with both ej,...,e, and f1,...,f satisfying 
the conditions of the theorem. We must show that | = k, and then 
that f; = e; for i=1,...,k. 

We begin with an observation that we will use crucially. Let s 
and t be positive integers. Then the number of elements of the cyclic 
group Z, whose order divides ¢ is equal to gcd(s, t). In particular, if 
s divides t the number of these elements is s and if ¢ divides s the 
number of these elements is ¢. Also, if t = p is prime, the number of 
these elements is p if p divides s and is 1 if p does not. 

With this observation in hand, we go to work. Since & and / are 
positive integers, we many as well assume that | < k. (Otherwise 
switch the e’s and the f’s.) Now e; > 1, so is divisible by some prime 
p. Then, since e, divides e; for every 7 < k, by the above observation 
we see that G has p* elements whose order divides p. But we can 
also compute this number from the second decomposition, and we 
see it is p" where I’ is the number of f1,...,f; that are divisible by p. 
Certainly I/ < 1. Counting the number of elements of G whose order 
divides p in two different ways must yield the same result, so p!” = 
p® and hence I' = k. Thus k < J, so we must have | = k. 

Now we must show that e; = f; for each i= 1,...,k. We proceed 
by downward induction on 7. We begin with 1 = k. Let us count 
the number of elements of G whose order divides e,. From the first 
decomposition we see it is ae From the second decomposition we see 
that it is the product gcd(f1,e,)gcd(f2,en)... gcd(f;,e,). Thus, since 
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again the two counts must yield the same result, we must have that 
ecd(fx,ek) = ex, i.e., that e, divides f, (and then automatically gcd(fi, 
e.) = ey for every 1=1,...,k). Reversing the roles of f; and e,, we 
see that gcd(ex, fr) = fk, i-e., that f, divides e,. Hence f, = ex. 

Now for the inductive step. Suppose that fp, = ex, fe-1 = 
€k—1,---Ji+1 = 41 and consider e; and f;. Let us count the number 
of elements of G whose order divides e;. From the first decomposi- 
tion we see that this number is the product et €j41---€k while from 
the second decomposition (using the inductive hypothesis) it is the 
product gcd(f1,e;)gcd(fa, 6)... gcd(fi,e;)ei41 -.. ex. Again these two 
answers must be the same, so in particular we must have gcd(fj;, e;) = 
e;, i.e., that e; divides f;. And again we can reverse the roles of e; 
and f; to conclude that gcd(e;, f;) = fi, i.e., that f; divides e;. Hence 
fi = Gj. 

Then by induction this is true for every i = 4&,...,1, 
and we are done. 


Definition 2.6.14. Let G be a finite abelian group. 
The integers e1,...,e, as in the conclusion of Theorem 2.6.13 are 
called the elementary divisors of G. © 


Before proceeding further, we should observe an important point 
about the proof of Theorem 2.6.13. The best way to make this obser- 
vation is by looking at an example. 

Let G = Z4 ® Zo. Of course, this group is given to us as a direct 
sum of cyclic groups, but suppose we didn’t notice that and wanted 
to prove that it was. So we follow the strategy of the proof of Theo- 
rem 2.6.13. This group has exponent 4, so we begin by choosing an 
element a; of order 4, say a, = (1, 0). Now we look at the quotient 
of G by the subgroup generated by a;, which is isomorphic to Za, 
a cyclic group of order 2, and look for an element a2 of G of order 
2 that projects to the element of order 2 in this quotient. Such an 
element is a2 = (0, 1), and we’re done. But all we know is that we can 
find an element whose projection has order 2, so we might instead 
have chosen a4, = (1, 1), which is an element of G of order 4, which 
doesn’t work. But the proof of Theorem 2.6.13 shows that we can 
“fix” this element by letting a2 = a — a; = (0, 1), which does work. 
Note that what was crucial in being able to fix aj was that a, was 
an element of maximal order. 
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But suppose we hadn’t done things this way, and just had started 
out picking any element. If we had started by choosing a; = (0, 1), 
of order 2, then we could have chosen ag = (1, 0), of order 4, and 
our proof would have gone through. But instead, suppose we had 
started by choosing a; = (2, 0), also an element of order 2. Then we 
would have been stuck. There would have been no way of choosing 
az of order 4 so that G would be the direct sum of the subgroup 
generated by a; and ag, as if ag is any element of G of order 4, aq is 
an element of the subgroup of G generated by ag. (We leave this for 
you to check.) Thus our strategy of choosing an element of maximal 
order is what guarantees that our proof indeed does work. 

By the way, although the choices of a; = (1, 0) and az = (0, 1) 
are the “obvious” choices for G = Z4 © Zo, these are not the only 
choices. Indeed, our proof shows that we can make any choices for a 
and ag, providing that these elements have the right orders (4 and 2 
respectively). Thus, for example, we could have equally well chosen 
oy = {1 1), or a= (2, 1). 

There is a second kind of decomposition of a finite abelian group 
G as a direct sum of cyclic groups that is often more useful. 

We begin with the case of G cyclic. 


Theorem 2.6.15. Let G be a cyclic group of order n > 1. Let n = 
dy di ; ae ee 

py... py’ be the prime factorization of N, with p,...,pp distinct 

primes. Then G is isomorphic to the direct sum Za Be: @ Zak: 
1 k 


Proof. By induction on the number of distinct prime factors k of n. 
If k = 1 there is nothing to prove. 
The crucial case is k = 2. Suppose n = pip, Since p" and p? 
are relatively prime, we know that Z, is isomorphic to Z a ® Z da 
al; 2 


Now suppose the theorem is true for k — 1 and let n = p" Sed pt. 
Then pa and the product pe ae pik are relatively prime, so by the k 


= 2 case Z, is isomorphic to Z pil eZ pi Pe But then by the k — 1 


case, Z do. nit is isomorphic to Lae Be::- OZ pik» SO G is isomorphic 

go 2 Pr 
to Z da e::-@Z tk and by induction we are done. 
5 


Pr 


Before proving our next general theorem, let us return to Example 
2.6.9 and reexamine it from this perspective. 
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Example 2.6.16. 


(a) Let G = Zo. Since 30 = 2 - 3 - 5, we see that Zo is isomorphic 
to Zz @ Z3 © Zs. But we could regroup the right hand side as 
(Z3 ® Z5) ® Zz, so Z30 is isomorphic to Z15 @Ze, or as (Zo © 
Zs) ® Z3, so Z3q is isomorphic to Z19 © Z3, or as (Zg © Z3) @ 
Zs, so that Zo is isomorphic to Zg @ Zs. 

(b) Let G = Zigo ® Zoq. Since 180 = 4- 9 - 5, Zigo is isomorphic to 
Za © Zg ® Zs, and since 24 = 8 - 3, Zq is isomorphic to Zg © Zs. 
Thus G is isomorphic to (Z4 © Zg © Zs) © (Zg © Zz). Regroup- 
ing terms on the right hand side as (Zg © Z3 © Zs) © (Z4 © Zo), 
we see that G is isomorphic to Z129 © Z36, and regrouping them 
as (Zg ® Zo © Zs) © (Z4 © Z3), we see that G is isomorphic to 
Z369 © Zy2. 0) 


Here is a more general decomposition theorem. 


Theorem 2.6.17. Let G be an abelian group of order mn, with m 
and n relatively prime. Let Ym: G— G by Ym(g) = mg, and let yn: 
G > G by gn(g) = ng. Let Hn = {g € G| order of g divides m} 
and H, = {g € G| the order of g divides n}. Then 


(2) Ba = Ker(ay) = Im(G,_) and Hy, = Ker(y,) = Imlep,). 


Proof. 


(a) By definition, Hy», = Ker(ym). We must show Ker(y,,) = Im(yn). 
We show this by showing that each is contained in the other. 
Im(yn) C Ker(ym): Let h € Im(y,,). Then h = mg for some g € 
G. But then y,(h) = gn(mg) = (mn)g = 0 as the order of any 
element g of G divides |G| = mn. 

Ker(ym) C Im(y,): Since m and n are relatively prime, there 
are integers a and b with am + bn = 1. Now let g € Ker(y,), 
so that mg = 0. Then g = lg = (am + bn)g = a(mg) + n(bg) 
= a(0) + n(bg) = yn(bg). 

The second claim in (a) is proved the same way, switching the 
roles of m and n. 

(b) We must show H,,H, = {0} and G = H,, + H,. Again choose 
integers a and 6 with am + bn = 1. 

Am O Hy = {0}: Let g € Hm OM An = Ker(yn) N Ker(ym). Then 
g = 1g = (am + bn)g = a(mg) + b(ng) = a(0) + b(0) = 0. 
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Hm + Hn = G: Let g € G. Then g = lg = (am + bn)g = 
m(ag) + n(b9) = Ym(ag) + Yn(b9). 


Example 2.6.18. 


(a) Returning to Example 2.6.16(a), we see that example is an illus- 
tration here as well. 

(b) Returning to Example 2.6.16(b), we see that G = Zigq ® Zag is 
a group of order 4320 = 2° - 3° - 5. Writing 4320 = 2° - (3° - 5), 
we obtain that G is isomorphic to (Zg © Z4) © (Zo © Zz @ Zs); 
writing 4320 = 3° - (2° - 5), we obtain that G is isomorphic to 
(Zo ® Z3) © (Zg ® Za © Zs); writing 4320 = 5 - (2° - 3°), we 
obtain that G is isomorphic to Z5 ® (Zg ® Z4 ® Zo © Zs). 


Remark 2.6.19. In Example 2.6.18, we know the structure of G 
exactly. But the point of Theorem 2.6.17 is that we don’t have to. 
For example, let G be a group of order 96 = 32 - 3 whose structure 
we don’t know. Then we can conclude that G = H32 @ H3, where 
H39 is the subgroup of G consisting of those elements of G whose 
order divides 32 and H3 is the subgroup of G consisting of those 
elements of G whose order divides 3. Similarly, if G is any group of 
order 240 = 16 - 15, there we can conclude that G = Hig © As, 
where H1¢ is the subgroup of G consisting of those elements of G 
whose order divides 16 and Hj5 is the subgroup of G consisting of 
those elements of G whose order divides 15. 

Also, we have put Theorem 2.6.17 here as it serves as a lead-in to 
our next result. By we wish to observe that this theorem does not 
depend on Theorem 2.6.13. Its proof is completely independent of 
the proof of Theorem 2.6.13, and in fact much easier. © 


Here is our next general result. (To preclude confusion, let us 
specifically state that we allow the set S in Theorem 2.6.20 to have 
repeated entries.) 


Theorem 2.6.20 (Alternate structure theorem for finite 
abelian groups). Let G be a finite abelian group. If G is the triv- 
ial group, then G is isomorphic to Z,. Suppose that G is nontriv- 
ial. Then there is a set S = {q,...,q} of positive integers, each of 
which is a prime power, such that G is isomorphic to Zg, ®-:: B Zq,.- 
Furthermore, the set S is unique. 
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Proof. If G is the trivial group, the theorem is certainly true. Sup- 
pose not. 

By Theorem 2.6.13, G is isomorphic to Ze, © --- ® Ze, with all 
e; > 1. Factor each e; into a product of powers of distinct primes: 


— dir diji 
C1 = Pi Pay 


_ dri dkjk 
Ck = Pry Dk ; 


Then by Theorem 2.6.15, 


Ze, is isomorphic to Z aaa ees dia 
11 1jl 


Ze, is isomorphic to Za, @::- OZ ayy, 
Pri Pkik 


and so G is isomorphic to 


(z a1 0: OZ an) O-:-@ (z de O': OZ me 
Pu 4jl Pri Prik 
and S = {Ons 34 Pers ae pak see's 2 \. 

Now we must show that S$ is unique. 

We see from the above construction that the elementary divi- 
sors €1,...,@, determine S. We shall show that, conversely, the set 
S determines the elementary divisors. Then, since the elementary 
divisors are unique, S is unique. 

Here is an algorithm for recovering the elementary divisors from 
S. Set S; = S. Let i = 1. 

For any value of i, let R; be the subset of S$; consisting of, for every 
prime, an element of S; of the highest power of that prime. Let e; be 
the product of the elements of R;, and let S;,,; = S; — R;. Change 
i to i+1 and iterate this process. (Note that this process will stop 
as S is finite and we remove at least one element from S; at every 
step. In fact, it will stop at step k, yielding e,, when k is the largest 
integer such that, for some prime p, there are k powers of p in S.) 


Definition 2.6.21. Let S be the set in the conclusion of Theorem 
2.6.20. In case G is the trivial group, let S be the empty set. Then 
the elements of S are the invariant factors of G. © 
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We have just proved the structure theorem for finite abelian 
groups. 

Now we wish to generalize this to finitely generated abelian 
groups, which we first define. 


Definition 2.6.22. Let G be an abelian group. A set A = {2;} of 
elements of G generates G if every element g of G can be expressed as 
g = 44n;x; for some integers {n,;}. If there is a finite set of generators 
for G, then G is called finitely generated. The minimum number of 
elements in a generating set for G is called the rank of G. © 


Example 2.6.23. 


(a) Any finite abelian group is finitely generated, as we may let 
A= G. 

(b) If G = {0}, G has rank 0. (By definition, the value of the empty 
sum is 0.) 

(c) If G is a nontrivial cyclic group, then G has rank 1. 

(d) If G is a finite abelian group with elementary divisors e1,...,¢, 
as in Definition 2.6.14, G has rank k. 

(e) If G = Z”, then G is finitely generated. We see right away that 
G has rank at most n. With considerably more work we will see 
that G has rank exactly n. © 


Actually, our statement that we will generalize the structure the- 
orem from finite abelian groups to finitely generated abelian groups, 
while true, is in a way misleading, as the key to our analysis will be 
to consider the diametrically opposite case, finitely generated abelian 
groups that have no elements of finite order other than the identity. 
Once we have done that, it will be easy to combine the two situations. 

A lot of what we will be doing should very much remind you 
of linear algebra, but, as you will see, the situation is much more 
involved here. Indeed, the proofs here turn out to be rather subtle. 

Let us introduce some standard terminology. 


Definition 2.6.24. An abelian group G is torsion-free if it has no 
elements of finite order other than the identity. © 


Now let us make some definitions that should remind you a lot 
of linear algebra. 
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Definition 2.6.25. Let G be an abelian group and let A = {2;} be 
a set of elements of G. 


(a) A is linearly independent if the equation 4;n;x; = 0 only has the 
solution n; = 0 for all 7. 

(b) A spans G if the equation 4;n;x; = g has a solution for every g 
€ G. 

(c) A is a basis of G if A is linearly independent and spans G. 


Note that “spans” is a synonym for “generates”. The word spans 
is usually used in linear algebra, while the word generates is usually 
used in group theory. 

Of course, we know that every vector space has a basis. We will 
prove, with a lot of work, that every finitely generated torsion-free 
abelian group has a basis. 

We do linear algebra over fields, e.g., the field Q. Here we will be 
doing analogous arguments over the integers Z. But you should not 
think that everything just goes through the same way — it does not. 
Here is a simple example that already illustrates the difference. 


Example 2.6.26. Recall the following theorem from linear algebra. 


Theorem. Let V be a vector space and let B be a set of elements 
of V. The following are equivalent: 


(i) B is a basis of V (i.e., B is linearly independent and spans V). 
(ii) B is a maximal linearly independent set of vectors in V. 
(iii) B is a minimal spanning set of vectors in V. 


Now let us think about G = Z. We could choose B = {1}. Then, 
indeed, B is a basis of G, and is a maximal linearly independent set 
of elements of G, and is also a minimal spanning set of elements of G. 

But if we choose B = {2}, then B is a maximal linearly inde- 
pendent set of elements of G, but B does not span G (and so is not 
a basis of G). And if we choose B = {2, 3}, then B is a minimal 
spanning set for G, but is not linearly independent (and so is not a 
basis of G). » 


Definition 2.6.27. An abelian group G is free if it has a basis. 


Note that if G has a basis {z1,...,2,} then we have an iso- 
morphism «: G —> Z* defined as follows: Let g € G and write 
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g = Yjnjx;. (Since {x1,..., 2} is a basis of G, every element g of G 
can be written in the form in a unique way.) Then «(g) = (m1,...,nx)- 
Thus we see that our definition here of a free abelian group agrees 
with our previous definition, Definition 2.6.10. 


Remark 2.6.28. Note that no element x of G of finite order can be 
linearly independent, as for such an element x we have nx = 0 for 
some n # 0. Similarly, if G has any elements of finite order other than 
the identity, then G cannot possibly have a basis, so G cannot be 
free. 7) 


Here is a result that is exactly analogous to the situation in vec- 
tor spaces. 


Lemma 2.6.29. Let G be an abelian group of rank k. Then any set 
S of more than k elements of G is linearly dependent. 


Proof. By induction on k. This is trivial if & = 0, so assume k > 
0. Since G has rank k, by definition it is spanned by some set A = 
{r1,..-,%%} of k elements of G. 

Let S = {y1, y2,...} be a set of more than k elements of G. 

The case k = 1: If yy = 0 or yo = 0 then S is linearly dependent. 
Otherwise, since {x} spans G, y; = ax, for some a € Z, a #0, and 
yo = bx,, for some b € Z, b #0. But then by; — ayg = 0. 

Now suppose the result is true for k — 1. Let H be the subgroup 
of G spanned by {21,...,%%-1}. 

If S C H, we are done by induction. Suppose not. For each 3, let 


Yi=Yitrix, where y; € A and 1; is an integer. 


(This expression for y; may not be unique. Choose any one.) At 
least one 7; #4 0. Assume it is ry. 

Suppose S = {y1, yo, ...} is linearly independent. Then {y1, 
r1Y2, T1Y3, ---} is also linearly independent. (If nyyi1 + no(riy2) + 
n3(riys) +--+: =0, then niyi + (nari)y2 + (n3ri)y3 +--+ = 0.) 
Then {y1, riy2 — reyi, T1y3 — T3y1 .--} is linearly independent. 
(If niyr + no(riye — royi1) + n3(riy3 — r3y1) + --- = 0, then 
(m1 — narg — n3r3...)y1 + (mari)y2 + (n3ri)y3 +--: = 0.) Then 
T = {riya — rey2, T1y3 — T3y1, ---} is linearly independent (being 
a subset of a linearly independent set). But for i > 1, 


neta =i (a Pee) RT ee) KH a EB, 
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as y; © Handy, € H. Thus T is a linearly independent set of more 
than & — 1 vectors in H; contradiction. 


This lemma has a consequence that is also analogous to the situ- 
ation in vector spaces. 


Corollary 2.6.30. Let G be a free abelian group with a basis B con- 
sisting of k elements. Then k = rank(G). In particular, any basis of 
G consists of rank(G) elements. 


Proof. By definition, the rank of G is the smallest number of ele- 
ments in any generating set B for G. Since B is a generating set, we 
must have rank(G) < k. But we cannot have rank (G) < k, as then, 
by Lemma 2.6.29, B would be linearly dependent. 


Now we have a technical lemma we will use later. 


Lemma 2.6.31. Let G be an abelian group and let S= {2,...,r%} 
be a finite set of elements of G. Let H be the subgroup of G spanned 
by S. Let A be any k-by-k matrix that is invertible over Z, and let 


Y1 Seal 
=A 
Yk Lk 


Let H' be the subgroup of G generated by S’ = {y,,.--, yx}. Then 
H' = H. Furthermore, S is linearly independent if and only if S' is 
linearly independent. 


Proof. Since H’ consists of those elements of G that can be 
expressed as linear combinations of the elements of S$’, and every 
element of S$’ can be expressed as a linear combination of the ele- 
ments of S, we see that every element of H’ can be expressed as a 
linear combination of the elements of S, and so H’ C H. 

But also 


Uk Yk 


so by the same logic H C H’, and hence H’ = H. 
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Now suppose S$ is linearly independent. Consider the equation 
Eby: = (0). We can write this equation as 


Yk 
and then 


[by «++ bg] | A 
| [a 
Baal 
[[b1 - + - bx] Al =0 
Lk, 


Since S is linearly independent, we must have [b;...b,] A = 0. 
But A is invertible, so we must have [;...b,] = [0...0], and thus we 
see that S’ is linearly independent. By the same logic, if S’ is linearly 
independent then so is S. 


Lemma 2.6.32. Let G be an abelian group of finite rank, and let 
A = {a;} be any finite generating set of G. Then a subgroup H of G 
has finite index in G if and only if there is an integer N € 0 such 
that Nx; € H for every 4 € A. 


Proof. First of all, note that the condition in the lemma is equiv- 
alent to the condition that for each x; € A there is an integer N; 4 
0 such that N;x; € H. To see this, let A = {2x1,...,2,}. If for some 
x; there is no nonzero integer N; with N;x; € H, then there is cer- 
tainly no nonzero integer N with Na; € H for every i. On the other 
hand, if there is such an integer N; for every x;, we could take N 
to be any nonzero integer that is divisible by Ny1,...,Nz, e.g., N = 
Icm(Njy,...,Nz,). (In fact, it is easy to check that any integer N that 
is not divisible by lem(N1,...,N,) will not have this property.) 
Now suppose H has finite index d in G. Then for each x;, we may 
consider the elements {0, 2;, 2%;,...,dz;}. These cannot be in distinct 
cosets of H in G, as there are d + 1 such elements. Thus we must 
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have two distinct elements of this set, say mz; and nz;, in the same 
coset, in which case, if N; = (n— m), Nia = (n— m)a; © H. 

On the other hand, suppose there is such an integer N. Then any 
expression Ena; with each n; divisible by N gives an element of 
H. We thus see that we have representatives of all of the cosets of H 
in G given by 


k 
{Sone 0 <n; < N for each 1 : 
i=1 

These representations may contain duplications, but whether or 
not they do the number of element in this set is N*, so H has at 
most N* cosets in G, and hence is a subgroup of finite index (in fact, 


index at most N*) in G. 


Theorem 2.6.33. Let G be a free abelian group of rank k. 


(a) Let C = {y1,..., yn} be a set of k element of G that spans G. 
Then C is linearly independent, and hence C is a basis of G. 

(b) Let C = {yi,..., yn} be a set of k element of G that is linearly 
independent. Let H be the subgroup of G spanned by C. Then H 
is a subgroup of G of finite index, and hence C' is a basis of a 
subgroup of G of finite index. 


Proof. Let B= {2x1,...,a,} be a basis of G. 
(a) Since B spans G, we have 
k 
Yi= Sty for some integers {pjj}, for each j= 1,...,k. 
j=l 
Similarly, since C spans G, we have 
k 
Ly = Ss aay for some integers {qi;}, for each j= 1,...,k. 
j=l 
Let P = (pj;;), the k-by-k the matrix whose entry in position (4;) is 
pij, and similarly let Q = (qj;). Then 


Yk Xk Lk Uk 
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But B is a basis of G, which means that every element of G can 
be expressed in a unique way as a linear combination of the elements 
of B, so we must have QP = IJ, the identity matrix. In particular, the 
matrices P and Q are invertible. 

Since B is a basis, it is linearly independent, and then we have 
that C is linearly independent by Lemma 2.6.31. 

(b) Again let us write 


k 


Yix= or for each i= 1,...,k, 
j=l 


and form the matrix P = (pj;;) as in part (a). 
Now B is a basis of G, so every element of G can be written 
uniquely as 


k 


) r,a;, for integer r1,...,Tp. 
j=l 


We define a new group Gg by 


k 
Go= {oseils <q}. 


i=1 


Since these expressions for the elements of G were unique, this gives 
us unambiguous expressions for the elements of Gg. Then Gg is an 
abelian group containing G. Not only that, but we can multiply any 
element of Gg by any rational number f. Thus, we see that Gg has 
the structure of a vector space over Q. 

We claim that B is a vector space basis for Gg. It certainly spans 
Gg, so we must show it is linearly independent over Q. So suppose 
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we have an expression 


k 
\> fei =0 with f, €Q. 


i=1 


Write f; as a quotient of two integers, f; = 7;/s;, for each i= 1,...,k. 
Then 


k 
S>( (3) =O. 


1=1 


Let s be the product s = s,... ,s,. Then, multiplying this equation 
by s, we obtain the equation 


k 
Ss" ( Tr 15)5;)) oe = 0. 


c=]. 


Now s; divides s, for each 7, so every coefficient r;(s/s;) is an 
integer. By hypothesis B is linearly independent over Z, so each of 
these coefficients must be 0, so each r; must be zero, and so each f; 
must be zero, as required. 

Thus we see that Gg is a Q-vector space which has B = {z1, 

, Z } as a vector space basis, so in particular it has dimension k. 

Now consider the set C. We are assuming that C is linearly inde- 
pendent over Z, so, by the same argument as we gave above for B, 
we can conclude that C is linearly independent over Q. 

Thus, C is a linearly independent set of k elements in the vector 
space Gg of dimension k, so C’ is a vector space basis of Gg. Thus C 
spans Gg and so we have 


k 
i= >> qjyj for some rational numbers{q,;}, for each i= 1,...,k. 
j=l 


Again we let Q = (q;). We note that P is a k-by-k matrix with 
entries in Z, and Q is a k-by-k matrix with entries in Q. We argue 
as before: 
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SO 


1 Ty 


Uk, Uk, 


and once again B is a basis, so QP = J, and P and Q are invertible 
matrices over Q. In particular, Q = P~!. Now Q is a matrix with 
rational coefficients, but we can write any such matrix as Q = (1/s0) 
Q' where Q’ has entries in Z and sg is some nonzero integer: Just 
take sg to be the least common multiple of the denominators of all 
of the entries of Q. 

Then 


so 
iq ‘A — ‘d 
so} i |=Q Pi j,, : | =Q"]: 
= | | a 
In other words, if H is the subgroup of G spanned by C = 


{y1,---,Yk}, 80% € H for each i = 1,...,k. But then by Lemma 
2.6.32, H is a subgroup of finite index in G. 


Theorem 2.6.34. Let G be a free abelian group of rank k. Then any 
subgroup H of G is a free abelian group. 


Proof. This is certainly true if H = {0}, so assume H # {0}. 
We prove this by induction on k. 
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If k = 1, G is isomorphic to Z, i.e., G is an infinite cyclic group, 
and we have already shown, in Lemma 2.3.8 that a nontrivial sub- 
group H of an infinite cyclic group is itself infinite cyclic. (If G is 
generated by an element x71, then H is generated by na, for some 
nonzero integer 7.) 

Now suppose the theorem is true for any free abelian group of rank 
k —1, and let G have rank k. Let B = {2,...,%} be a basis of G. 

Let J be the subgroup of G generated by {x1,...,2,~1} and let K 
be the subgroup of G generated by {z,}. Note that J is free abelian 
of rank k —1, K is free abelian of rank 1,and G=J 0 K. 

We have a group homomorphism 7: G — G given by 


(ny 21 tee ft Np_12p_1 + Nee) = NX, +++ + Np_1LR-1 


and we observe that Ker(7) = K and Im(7) = J. Let y be the 
restriction of 7 to the subgroup H, so that y is given by the same 
formula, but is only defined for elements of H. Let J; = Im(y) and 
K, = Ker(y). Then J; C J and Ky C K (more precisely, K, = H 
geo? 

Since J; C J, and J is free abelian of rank k — 1, we have, by the 
inductive hypothesis, that J; is a free abelian group (of some rank j 
< k-1 by Lemma 2.6.29). 

If kK; = Ker(y) = {0}, then y: H > J is an isomorphism and 
hence H is free abelian. 

Suppose Ky; # {0}. Let {z1,...,z;} be a basis of J; and let 
{y1,---,yj} be elements of H with p(y) = 4, i = 1,...,7. Then, 
by Lemma 2.6.11, there is a homomorphism A: J; > H with A(z) = 
yi, += 1,...,7. In the language of Definition 2.4.34, A is a splitting 
of y, and so by Lemma 2.4.35, H is isomorphic to J; @ Ky. (Note 
our argument here is very similar to the proof of Theorem 2.6.13.) 
But K, C K is free abelian by the k = 1 case. Thus H is isomorphic 
to Z) ® Z, i.e., H is free abelian of rank j + 1. 


Now we come to our second main result. 


Theorem 2.6.35. Let G be a finitely generated torsion-free abelian 
group. Then G is free. 


Proof. First we shall show that G contains a subgroup of finite 
index that is a free abelian group, and then we will use a “trick” to 
show that G itself is a free abelian group. 
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Let A = {21,...,2,} be any finite set that generates G. Choose 
a maximal linearly independent subset B of A. Renumbering the 
elements, if necessary, we may assume that B = {21,...,2;}. Let H 
be the subgroup generated by B. Then 4 is free abelian (as B is a 
basis of H). We claim that H is a subgroup of G of finite index. 

If 7 = k, then H = G. Suppose not. Then for each i= 7+1,...,k, 
the set {21,...,2j;, a} is not linearly independent, so we have a 
relation 


I 
(> br] + a;x; = 0 for some integers b/,... , bj 
n=1 and some integer a; with a; 4 0. 


But that gives us that a;7; € H for each i= j+1,...,k. Of course, 
la; = a € H for each i= 1,...,7. But then, from Lemma 2.6.32 (see 
in particular the first paragraph of the proof), H is a subgroup of G 
of finite index. 

Now for the trick: Again by Lemma 2.6.32 there is some nonzero 
integer N such that Na; € H for every x; € A, and since every element 
z of G can be written as 7 = nx, + +--+ + Np, we see that Nx € 
H for every x € G. In other words, if we define y: G > G by (az) = 
Nz, then Im(y) C H. Now y is 1-1 precisely because G is torsion- 
free: Nx = 0 implies « = 0. Thus g is an isomorphism onto its image 
G; = y(G). But G, is a subgroup of the free abelian group H (and H 
has finite rank; indeed we must have that the rank of A is at most k, 
again by Lemma 2.6.29), so, by Theorem 2.6.34, G1 is a free abelian 
group; since G is isomorphic to G1, G is free abelian as well. 


We can simply assemble some of the results we have proved to 
state a slightly sharper result. 


Corollary 2.6.36. Let G be a free abelian group of rank k. Then any 
subgroup H of G is free abelian of rank j < k. Furthermore, j = k if 
and only if H is a subgroup of G of finite index. 


Proof. By Theorem 2.6.35 we know that H is free. By Lemma 
2.6.29 we know that 7 < k. 

If 7 = k, we know from Theorem 2.6.33(b) that H is a subgroup 
of finite index. 
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Suppose that H is a subgroup of G of finite index. We apply the 
“trick” in the proof of Theorem 2.6.35: G is isomorphic to a subgroup 
G, of H. But then G, also has rank k, and by Lemma 2.6.29 again 
we know that k < j, soj =k. 


Remark 2.6.37. Note the hypothesis in Theorem 2.6.35 that G 
be finitely generated. This theorem is false in general without this 
hypothesis. For example, G = Q is a torsion-free abelian group that 
is not free. v) 


We have previously handled finite abelian groups, and we have 
just handled finitely generated torsion-free abelian groups. It is easy 
to put these together to obtain a final structure theorem. 


Definition 2.6.38. Let G be an abelian group. Its torsion subgroup 
Gtor is the subgroup of G defined by 


Gtor = {x € G| xis an element of finite order}. © 


It is easy to check that Gj, is a subgroup of G. Note, however, 
that if G has both elements of infinite order and nontrivial elements 
of finite order, {sc € G | z is an element of infinite order} U {the 
identity} is never a subgroup of G. For example, if G = Z ® Za, 
(1, 0) is an element of infinite order, (—1, 1) is an element of infinite 
order, but their sum (1, 0) + (-1, 1) = (0, 1) is an element of finite 
order (order 2). 


Theorem 2.6.39 (Structure theorem for finitely generated 
abelian groups). Let G be a finitely generated abelian group. Then 
G is the direct sum T ® F, where T is a finite abelian group and 
F is a finitely generated free abelian group. More precisely, T is the 
subgroup Gtor of G and F is isomorphic to the quotient group G/Gtor. 


Proof. Let 7 be the quotient map 7: G > G/Gior = Q. Observe 
that T = Ker(z) = Gor. Then Q is torsion-free, and is finitely gen- 
erated (as any set that generates G generates Q). Thus, by Theorem 
2.6.35, Q is free. But then, just as in the proof of Theorem 2.6.34, 7 
has a splitting A: Q > G. Let F = A(Q). Then G= T 6 F. 


Corollary 2.6.40. Let G and G’ be finitely generated abelian groups 
(possibly G! = G). If G is isomorphic to T @ F, T finite and F free 
abelian, and G' is isomorphic to T ® F’, T' finite and F’ free abelian, 
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then G and G' are isomorphic if and only if T and T' are isomorphic 
and F and F' are isomorphic. 


Proof. Any isomorphism y: G — G’ must restrict to an iso- 
morphism between Gj, and G’to,, and hence give an isomorphism 
between G/ Gio, and G'/G' tor. 

On the other hand, if T and T’ are isomorphic, and F' and F” are 
isomorphic, then G and G’ are certainly isomorphic. 


Corollary 2.6.41. A finitely generated abelian group G is deter- 
mined up to isomorphism by the elementary divisors, or invariant 


factors, of Gtor and the rank of G/Gtor-. 


Proof. This data determines Gio, and G/Gio, up to 
isomorphism. 


2.7 Applications to number theory 


In this section we apply our knowledge of finite groups to prove some 
results in number theory. These results are interesting and important 
in themselves, and we will also be using them later (in Section 3.8). 

Throughout this section p will denote a prime. 

Recall that Z, is the quotient group Z, = Z/pZ and, following 
our previous notation, if x,y € Z then [x], = [y]p if and only if «= 
y (mod pZ). We will follow universal practice and write this relation 
as x = y (mod p). We recall that Z, = {[0],,[1],,---,[p — 1],} and 
that p — 1 =-—1 (mod p). 

We let G = Zi = {[l]p,---,[2 — 1p} with group operation 
[m]p[n]p=[mn],. Note we are writing G multiplicatively. Since we 
will exclusively be considering elements and subgroups of G in this 
section, we will write all our groups multiplicatively. 

Also, we will abbreviate [i], to [¢] throughout this section. 


Theorem 2.7.1 (Fermat’s little theorem). Let p be a prime, 
and let a be an integer relatively prime to p. Then a?~' = 1 (mod p). 


Proof. Consider [a] € G. Since G is a group of order p—1, the order 
of [a] divides p—1. Thus [a]?~! = [1] in G. But [a]?~+ = [a?—1], so 
[a?—"] = [1] in G, ie., a? -' = 1 (mod p). 
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The following easy number-theoretic lemma turns out to 
play a key role. 


Lemma 2.7.2. Let x and y be integers. Then a? = y” (mod p) if and 
only if = +y (mod p). In particular, x2 = 1 (mod p) if and only if 
x =+1 (mod p). 


Proof. Certainly if s = y (mod p) then x? = y? (mod p), and if 
x = —y (mod p) then 2? = (—y)? = y” (mod p). 

Conversely, suppose that 2? = y? (mod p). Then p divides 2?—y? 
= (x—y)(x+y). Since p is a prime, p must divide one of the factors. 
If p divides x—y, then z = y (mod p), and if p divides x + y, then 
x =—y (mod p). 

The second conclusion of the lemma is the special case y = 1. 


Theorem 2.7.3 (Wilson’s theorem). Let p be a prime. Then 
p-l 
(p—1)!= II n = -—1 (modp). 
n=1 
Proof. If p= 2, then 1! = 1 = —1 (mod 2). 
Suppose that p > 2. Then 


p-1 p-2 

(e-1)) = [= (Im) [p—l] eG. 
n=1 n=2 

Let us examine the product inside the parentheses. We make two 

observations. Let [n] = [jy], 2 < 7 < p—2. Since G is a group, [j] has 

an inverse [j]~' = [k] for some k. We observe: 


(1) Since [1]-1 = [1] and [p—1]—! = [p—1] (since [1]? = [1] and 
[p—1]? = [1]), [k] # [1] and [A] A [p—]]. 

(2) [k] A [j]. For if [k] = [j], then we would have [1] = [/][k] = [j]?, 
ie., 7? = 1 (mod p), and that cannot happen by Lemma 2.7.2. 
Thus we may group the p—3 terms in this product into (p—3)/2 

pairs {[j], [j]~1}. Now the product of terms in each pair is [1], so 


this product is [1](°—%)/2 = [1], and then the entire right-hand side 
is [1][1][p—1]. Thus [(p—1)!] = [p—1] in G,ie., 


(p —1)!=p—1=-1(modp), 


as claimed. 
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Our goal in the remainder of this section is to investigate 
quadratic residues and nonresidues (mod p). These are defined as 
follows. 


Definition 2.7.4. Let a be an integer relatively prime to p. Then a 
is a quadratic residue (mod p) if there is an integer b with 6? = a 
(mod p). Otherwise, a is a quadratic nonresidue (mod p). 

Equivalently, let a be an integer relatively prime to p. Then aisa 
quadratic residue (mod p) if [a] = [b]? for some [b] € G. Otherwise, 
a is a quadratic nonresidue (mod p). 

We call [a] € G a quadratic residue/nonresidue according as a is 
a quadratic residue/nonresidue (mod p). .) 


At this point we are faced with a choice. Our goal is to 
prove Euler’s theorem (Theorem 2.7.12). There are two ways to 
reach this goal. 

The first way is purely by group theory. We will show, just using 
group theory, that a certain subgroup of G is cyclic, and that is 
enough for us to be able to derive Euler’s theorem. 

But in fact, the group G is cyclic. We can’t prove this just using 
group theory, but we will be able to prove it using some (easier) group 
theory and some ring theory, and that will easily give us Euler’s 
theorem (see Section 3.8). 

So we have a choice. You can read the group theory proof now, 
or the group/ring theory proof later. But as usual in mathematics, 
“or” is inclusive — if you wish, you can do both! 


Corollary 2.7.5. Let p be a prime, and let a be an integer relatively 
prime to p. Then a'?—))/? = +1 (mod p). 


Proof. Let b = a®—/?, Then b? = (a@-)/?)? = a?-! = 1(mod p) 


by Fermat’s little theorem. But then 6 = +1 (mod p) by 
Lemma 2.7.2. 


Lemma 2.7.6. For any odd prime p, there are (p—1)/2 quadratic 
residues and (p—1)/2 quadratic nonresidues in G. 


Proof. Let y: GG be the homomorphism defined by y({k]) = 
[k]? = [k?]. By definition, Ry = {quadratic residues in G} = Im(y). 

By the first isomorphism theorem (Theorem 2.5.3), R 4 is iso- 
morphic to G/Ker(y). Now Ker(y) = {[k] € G|[k]? = [1]}. By 
Lemma 2.7.2, Ker(y) = {[1], [p—1]}, a subgroup of G of order 2. Thus 
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|R4| = [G: Ker(y)] = |G| /|Ker(y)| = (p—1)/2. If R- = {quadratic 
nonresidues in G}, then R- = G- R,, so # (R_) = # (G) — # 
(Ry) =@-D-@-D/2=(@-V/2. 


Remark 2.7.7. Note that Ri is a subgroup of G and that R_ isa 
coset of R_. .) 


Now, in the notation of the proof of Lemma 2.7.6, let us con- 
sider the quotient group G/R ,. This is a group of order | G|/|R+| 
= (p—1)/((p—1)/2) = 2, so must be isomorphic to Zz. Following 
common mathematical practice here, we write this group multiplica- 
tively, as {+1}. 


Definition 2.7.8. With the above identification, we let yp: 
G—{+1} be the quotient map, so that x,([a]) = 1 if [a] is a quadratic 
residue in G and x,({a]) = —1 if not. The homomorphism x, is called 
the quadratic residue character. 

For an integer a relatively prime to p, we define the Legendre 


symbol (4) to be (<) = xp([a]), so that (4) = 1 if a is a quadratic 


residue (mod p) and (<) = -—1 if a is a quadratic nonresidue 
(mod p). ) 


Remark 2.7.9. We are doing violence to mathematical history here. 
In the development of mathematics (at least on planet earth) number 
theory preceded group theory, so the Legendre symbol was defined 
long before the quadratic residue character was. 

But in a way this is part of our point here. Looking back on 
number theory from the point of view of group theory, we can see 
how group theory both enriches and clarifies number theory. © 


Lemma 2.7.10. Let a and 6 integers relatively prime to p. Then 
aia =) 
p) \p p)- 


(i) If a and 6 are both quadratic residues (mod p), then ab is a 
quadratic residue (mod p). 

(ii) If one of a and 6 is a quadratic residue (mod p) and the other 
one is a quadratic nonresidue (mod p), then ab is a quadratic 
nonresidue (mod p). 


In other words: 
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(iii) If a and 6 are both quadratic nonresidues (mod p), then ab is a 
quadratic residue (mod p). 


Proof. This is just the statement that yp: G—o{+l} is a 
homomorphism. 


In the statement of the next lemma, we will write G = Hox x 
H,, rather that Hy: 6 Hy as we are writing G multiplicatively, and 
similarly in its proof. Thus, in particular, we will write the elements 
of a cyclic group as powers of a generator, and denote the identity 
element of any group by e. 


Lemma 2.7.11. Let p — 1 = 2*n where n is odd, and write G = 
Hox x Hy, as in Theorem 2.6.14. Then the subgroup Hox of G is cyclic. 


Proof. Suppose not. Then by either of our two structure theorems 
(Theorem 2.6.13 or Theorem 2.6.20), Hs is isomorphic to a direct 
product of at least two cyclic groups, each of whose orders is a power 
of 2. So in particular Hj. has a subgroup K isomorphic to Ze x Zo 
with e, f > 1. Let y: K — Zoe x Zo; be an isomorphism. 

Let a be a generator of Zye and let b be a generator of Zos. Of 
course, (e, e) is the identity in Ze x Zys, so has order 1. But notice 
that (e,e)2 = (e,e), (a ',e)? = (e,e), (e,b%*)? = (e,e), and 
(a2°"~",b2/~")? = (e, e). Thus we see that Ze x Zor has at least (and 
in fact exactly) 4 elements of order dividing 2. 

But then Ho., and hence G, has at least 4 elements of order 
dividing 2, namely y7!((e,e)), 9-1 ((a" ',€)), pe H(e,b” *)), and 
y!((a2"*,b””~")). But this is impossible by Lemma 2.7.2. 


Now we arrive at our goal. 


Theorem 2.7.12 (Euler). Let p be an odd prime. For any integer 


a relatively prime to p, a?—-)/? = () (mod p). 


Proof. Write G = Hox x H,, as in Lemma 2.7.11. Choose a gen- 
erator x of Hjx. Then G = { (x', y) |\C24g <P Ha. ge eh 
Let y:G—+G by y([m]) = [m?]. Note that y(Hj.) C Ho. and 
y(H,) C Hy. Also note that y|H,: Hy, > Hy is an isomorphism, 
since n is odd. (The argument for this is very much like the argu- 
ment in the proof of Theorem 2.6.17.) 
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As we have observed, Im(y) = Ry = {quadratic residues in G}. 
We claim Im(y) = S = {(2', y) | 0 < i < 2*—-1 with i even, y € 
H,,}. Consider such an element (x', y) with i even. Since y|H;, is an 
isomorphism, y = y(z) = z for some z € A,,. But then 


(2,2) = ((2)",.27) =i(iehy) 


Thus $ C Im(y). On the other hand, consider any element (z’, y) 
of G. Then (2, y) = (x74, y?). To be precise, this exponent is only 
defined (mod 2*). But since 2* is even, any i with i = 27 (mod 2*) 
must also be even. Thus Im(y) C S and these two sets are equal, 
i.e., Ss = Ry. 

Now let g = [a] € G. We have two possibilities: 


(1) g € Ry. Then on the one hand, 3) = 1 by the definition of the 
Legendre symbol. On the other hand, g = y(f) for some f € G 
and then g?—)/2 — (f2) PV? = fP-! =e €G, or in other 
words 


[aJP-D/? — Jae-0/2) = [1] € G, ie., a®-Y/? = 1 (modp), 


and the theorem is true in this case. 
(2) g € R_. Then on the one hand, ($) = —1 by the definition 


of the Legendre symbol. On the other hand, g = (a’, y) with i 
odd. (Again, this exponent is only defined (mod 2"), but since 
2" is even, any i’ with 7’ = i (mod 2*) must also be odd.) Now 
(p—1)/2 = 2*-'n so gG? VP = Car cama But 7” = 
1 so g@-D/2 = igre): Now « has order 2” and 2*~lim is 
not a multiple of 2* (as i and n are both odd) so a '™ Ze and 
hence g(?~))/2 4 e. But then by Corollary 2.7.5 


[a]@-D/2 — Jae-/2) = [-l1] € G, ie, a®-)/2 = —1 (modp), 


and the theorem is true in this case as well. 


Corollary 2.7.13. Let p be an odd prime. 


(a) Ifp = 1 (mod 4), then —1 is a quadratic residue (mod p). 
(b) Ifp = 3 (mod 4), then —1 is a quadratic nonresidue (mod p). 
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Proof. By Euler’s theorem, (+) = (-1)@-YP (mod p). If 


p =1 (mod 4), (p—1)/2 is even, and if p = 3 (mod 4), (p—1)/2 is 
odd. 


This tells us the quadratic character of —1. Next we want to find 
the quadratic character of 2. We first prove a general result, and then 
apply it to easily determine that. 


Lemma 2.7.14 (Gauss’s lemma). Let p be an odd prime and let 
a be an integer that is relatively prime to p. Let 


S = {i|1<i< (p—1) /2 and [ai] = [k] for somek 
with 1 < k < (p—1) /2}, 
T = {t|1<i< (p—1)/2 and [ai] = [k] for somek 
with (p+1)/2<k<p-l}. 
Let s = #(S) and t = #(T). Then a is a quadratic residue (mod p) 
if t is even, and a is a quadratic nonresidue (mod p) if t is odd. 
Proof. Write [m,] = [ai] if 1 © S and [n,] = [ai] if 2 € T. 
Of course, SUT = {i|1 <i < (p—1) /2} and Sn T = 9. That 
is, {S,T} is a partition of the set {1,...,(p—1)/2}. 
We first compute 
[ted [[ ed = [led [] le 
ics icT icS = eT 


(p—1)/2 
= [TI w= TL ta 
ie SUT i=l 
(p—1)/2 
=[al?-P? JT fd) = 2? - 1)/2)!1. 
i=1 
Next, we claim {[m,]} U {[p—n]} = {1,...,(p—1) /2}. Since 
the two sets on the left-hand side have s + t = (p—1)/2 elements, 
if suffices to show they are disjoint. Suppose not. Then [m] = 
[p —nj] = [—n,] for some i and j, ie., [ai] =—[aj], or ai + aj = 
a(t + 7) is divisible by p. Since p is a prime, it must divide one 
of the factors. Now p does not divide a, by hypothesis, so p must 
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divide i + j. But this is impossible as 1 < i, 7 < (p—1)/2, soi + j 
is between 1 and p—1. 


Thus 
(p—1)/2 
((p-)/2)= TT =] [led [fn 

i=l ieS i€T 

= | [ld [ [led 
ieS i€T 

= {(-1)] [led [lr 
ieS i€T 


[ [led [[led = (-)) 1 - 1)/2)!)- 


Comparing these two computations, and using Euler’s theorem, we 


(3) breton 


yielding the result. 


Corollary 2.7.15. Let p be an odd prime. 


(a) Ifp=1 or7 (mod 8), then 2 is a quadratic residue (mod p). 
(b) If p =3 or 5 (mod 8), then 2 is a quadratic nonresidue (mod p). 


Proof. Let p = 8k + r with r = 1, 3, 5, or 7. Let a = 2 
in Gauss’s Lemma. 

If r=1, T= {2k+1,...,4k} has 2k elements. 

If r= 3, T= {2k+1,...,44+1} has 2k+1 elements. 
If r= 5, T= {2k+2,...,4k+ 2} has 2k+1 elements. 
If r=7, T= {2k+ 2,...4k+ 3} has 24+2 elements. 


Corollary 2.7.16. Let p be an odd prime. 


(a) Ifp=1 or 3 (mod 8), then —2 is a quadratic residue (mod p). 
(b) If p = 5 or 7 (mod 8), then —2 is a quadratic nonresidue 
(mod p). 
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Proof. This follows immediately from Corollary 2.7.10, 2.7.13 and 
2.7.15. 
What about a for other values of a? This is the subject of the 


famous Law of Quadratic Reciprocity, which we prove in Section 3.8. 


2.8 Actions of groups on sets 


In this section, we discuss the action of groups (both abelian and 
nonabelian) on sets, an important and useful topic in its own right, 
and one which will play a crucial role in proving the results of the next 
section. 

Let X be set. Recall from Example 2.1.13 that Aut(X) = 
{bijections o: X — X} is a group under the operation of compo- 
sition of functions. We often denote this group by Sx, and call it the 
symmetric group on the elements of X. It is easy to check that if we 
have a bijection, i.e., a 1-1 correspondence, f: X > Y, then Sx and 
Sy are isomorphic; indeed y: Sx > Sy by y(c) = faf~' is an iso- 
morphism. Thus if X is a nonempty finite set, with n elements, then 
X is isomorphic to the symmetric group on {1,2,...,n}, we denote 
this group by $,. We recall that 5, is a group of order n!. 

Here is the basic definition. 


Definition 2.8.1. An action of a group G on a set X is a homo- 
morphism ®: G— Aut(X). 0) 


This is a cryptic definition, so let us expand on it to see what it 
means. Let yy = ®(g). Then Im(®) = {y, | g € G} is a group of 
automorphisms of X (as it is a subgroup of Aut(X), which is the 
group consisting of all automorphisms of X), ie., {yg | g € G} isa 
set of automorphisms of X satisfying: 


(i) ve = id: X > X, 
(ii) Yon = Yon for any g, h € G, 
(iii) ya = (yy) for any g € G. 
We will see some examples — in fact, we have already seen some 


examples, without giving them this name — but we make another 
definition first. 


Definition 2.8.2. An action of a group G on aset X as in Definition 
2.8.1 is effective if Ker(®) = {e}. 0) 
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In other words, an action is effective if the only element g of G 
for which yg = id is g = e (the identity element of G). As usual, ® 
is effective = Y~g = yp if and only if g = h. 

Then, by Lemma 2.5.2, any action ® of G on X factors 
through Ker(®), and gives an effective action of the quotient group 
G/Ker(®) on X. 


Example 2.8.3. 


(a) If Gis asubgroup of Aut(X) (or G = Aut(X)) then the inclusion 
of G into Aut(X) (or the identity map from G to Aut(X)) gives 
an action of G on X. This may not sound like it gives us any- 
thing new, but we should point out that some of our examples 
of groups arose exactly from this construction. For example, we 
constructed D2, as the automorphism group of a regular n-gon. 
From our point of view here, D2, is a subgroup of Aut({vertices 
of a regular n-gon}). Similarly, although we did not construct 
Z* in this way, we observed that Z* = Aut(Z,,), ie., Z* is a sub- 
group of the automorphism group of the set {[0]n,[I]n,.-.,[n — 
1Jn} of the elements of Z,, consisting of the automorphisms of 
this set that preserve the group structure. (For [i] € Z}, ij : 
Zn —> Ly is the homomorphism yj, ([k]) = [2].) 

(b) We constructed semidirect products in Lemma 2.4.28 precisely 
by using group actions, though we did not use that term there. 
In particular, we constructed the dihedral group D2, in that way 
in Example 2.4.31. © 


Our next family of examples of group actions is a well-known 
theorem. 


Theorem 2.8.4 (Cayley’s theorem). Let G be a group. Then G 
is isomorphic to a subgroup of Sg. In particular, if G is a finite group 
order n, then G is isomorphic to a subgroup of Sy. 


Proof. For any element go of G, let yz, : G + G be the map 
Ygo(9)= gog. We leave it to the reader to check that this is an effective 
group action of G on the set of elements of G. 


We can generalize this example, but first we need a lemma, 
and a definition. 
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Lemma 2.8.5. Let H be a subgroup of G. Then there is a largest 
normal subgroup of G contained in H, i.e., there is a subgroup N of 
HT such that: 


(i) N is a normal subgroup of G; and 
(ii) If N’ is any subgroup of H that is a normal subgroup of G, then 
N' is contained in N. 


Proof. We let N be the product of all of the normal subgroup of 
G contained in H. 


Definition 2.8.6. The subgroup N of Lemma 2.8.5 is the normal 
core of H, written Coreg(H) (or Core(H), when there is no possibil- 
ity of confusion). 0) 


Remark 2.8.7. We observe that if H is a normal subgroup of G, 
then Core(H) = H. Otherwise Core(#) is a proper subgroup of H. © 


Theorem 2.8.8. Let H be a subgroup of G. Then there is an action 
® of G on G/H, the set of left cosets of H in G, given by Yq. (L) = goL 
for every go € G and every left coset L of H in G. The kernel Ker(®) 
of this action is the normal core Core(H) of H. 


Proof. We leave it to the reader to check that ® is a group action. 
We determine Ker(®). We begin with two observations: 


(i) Ker(®) C H: Let go € G, go ¢ H. Then v,,.(H) = gH # H, so 
Ygo is not the identity on G/H. 
(ii) Ker (®) <G : Ker(®) is the kernel of a homomorphism. 


With these observations in hand, we complete the proof by show- 
ing that if N’ is any subgroup of H that is a normal subgroup of G, 
then y,(L) = L for every n € N’ and every L € G/H. 

By choosing a coset representative, we can write L = gH for same 
g € G. Then 


a (gH) =(ng) HT =(¢(g“ng)) 2 =g((o ng) Hf) =sf, 


where the last equality is first of all because g~!ng € N’, as N’ 
is a normal subgroup of G, and second of all because N’ is a sub- 
group of H. 


Here is an application of these ideas. 
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Corollary 2.8.9. Let H be a subgroup of the finite group G. Let 
k=[G: H], and suppose that |G| does not divide k! Then Core(H) # 
{e}. In particular, in this case H contains a nontrivial normal sub- 
group of G. 


Proof. Let N = Core(H), so that N = Ker(®) where © is as in 
Lemma 2.8.9. We know that G/N is isomorphic to Im(®), a sub- 
group of Sg/H. But Sex is isomorphic to S;. Hence |G/N| = 
|Im(®)| divides |S; | = k!. By hypothesis, |G] does not divide k!, 
so we must have |N| > 1. 


Corollary 2.8.10. Let G be a finite group and let p be the smallest 
prime dividing |G|. Then any subgroup H of G of index p is normal. 


Proof. Let N = Core(H). We shall show that N = H. 

By the proof of Corollary 2.8.9, |G/N| divides p! = 1-2---(p—1)p. 
But, since |G/N| divides |G|, and p is the smallest prime dividing 
|G|, this is only possible if |G/N| = 1 or p, i.e., [G:N] = 1 or p. But 
N C H and [G:H] = p, so we must have [G:N] = p = [G:H], in 
which case N = H. 


We now return to considering general group actions. 


Definition 2.8.11. Let ®: G-Aut(X) be an action of the group G 
on the set X. 


(a) For x € X, the orbit Orbit(x) = {y € X | y = y,(x) for some 
Ee Xx}. 
(b) For z € X, the stabilizer Stab(z) = {g € G | yg(x) = x} 


(Note that Orbit(a) is a subset of X while Stab(z) is a subgroup 
of G). 


We observe: 


Lemma 2.8.12. The relation ~ as on X defined by x ~ y if y € 
Orbit(x) is an equivalence relation on X. Consequently, if {Xi}ier is 
the set of equivalence classes of elements of X under ~, {Xj}ier is 
a partition of X. 


Proof. We leave this as an exercise for the reader. 


Definition 2.8.13. If there is only one orbit of X under the action 
of ® of G (or, equivalently, if for any z, y € X there is a g € G with 
g(x) = y), this action is called transitive. ?) 
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Lemma 2.8.14. Let ®: G> Aut(X) be an action of the group G on 
the set X. 


(a) Let x € X. Then there is a 1-1 correspondence 


Orbit (x) < G/Stab (az) 
given by 
y € Orbit(x) © gStab(a) where g € G with y = ~g(2). 


(b) Let x € X. Let K be any subgroup of G that is conjugate to 


Stab(xz). Then K = Stab(y) for some y in the orbit of x in this 
action of G on X. 


Proof. 


(a) Note that the given correspondence is obtained by a choice of 


wna 


element g € G, so our first job is to show that this correspondence 
is well defined, i.e., independent of the choice. 

So suppose we have elements g and h of G with y = Pq(2) 
= p(x). Then x = g71(yn(x)) = Yy-1n() so g th € 
Stab(z) and hence h € gStab(z), and similarly g € hStab(s), 
so g Stab(z) = hStab(z). 

Now we must show this is a 1-1 correspondence. Note that any 
element g’ of g Stab(z) is g’ = ggo for some go € Stab(x) and 
then q(t) = Yogo(2) = Pq (Pgo(2)) = ¥y (x). Thus if yr # 
y2 are two elements of Orbit(x), we cannot have y; € g Stab(z) 
and y2 € g Stab(z) (for any g) so this correspondence is 1-1. And 
also, if we choose y to be the element y = v(x), an element of 
X in the orbit of z, then y corresponds to gStab(z), for any g 
€ G, so this correspondence is onto. 

Suppose that x and y are in the same orbit, so that y = yy, (x) 
for some go € G. We claim Stab(y) = go Stab(x) go7. 

To see this, let g be any element of Stab(z). Then 


Pooggs? (Y) = Pao (% (G51 (v))) = Poo @ (G51 (v))) 
= Pago (Pg (@)) = Pao (2) 


so that goStab(z)g)' © Stab(y). By the same logic, 
9 Stab(y) go C Stab(a ), and so Stab(y) C go Stab(x) gy ', and 
ae these two subgroups of G are equal. 
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On the other hand, suppose that K and Stab(z) are conjugate, 
so that K = go Stab(z) go for some go € G. Let y = Yq (2), 
an element in the orbit of z. Then the above argument shows 
that K = Stab(y). 


Corollary 2.8.15. 


(a) Let x € X. Then #(Orbit(z)) = [G: Stab(x)]. 
(b) Let {2;};e7 be a complete a set of representatives of the equiva- 
lence classes {X;}ier as in Lemma 2.8.12. Then 
#(X) = 5 [G : Stab (2;)]. 
ier 

Proof. (a) is immediate from Lemma 2.8.14(a). As for (b), note 
that, since {X;};e7 is a partition of X, where X; = Orbit(z;), #(X) = 
ier # (Xi), so this follows directly from (a). 

Now let us apply these general considerations to a particular sit- 
uation. First, a definition. 


Definition 2.8.16. The center Z(G) of a group G is the set of 
elements of G that commute with every element of G, 


Z(G) ={z€G|zg= gz foreveryg eG}. ©) 
Remark 2.8.17. Z(G) is an abelian normal subgroup of Gj, as is 
any subgroup of Z(G). % 


We now consider another action of the group G on the set of 
elements of G, this time not by left multiplication (as in Cayley’s 
theorem) but rather by conjugation. 

That is, we let 6: G + Aut({elements of G}) by yg(h) = ghg7!. 
We leave it to the reader to check that this is a group action. 
Then two elements of G are in the same orbit if and only if they 
are conjugate (Definition 2.3.31). We call the orbits of G under 
conjunction the conjugacy classes of G. For x € G, Stab(x) = 
{g€G | grg-1 = x} = {gEG | gx = zg} consists of the elements 
of G that commute with 2; we call this the centralizer of x and 
denote it by C(z). 


Corollary 2.8.18 (The class equation for G). Let G be a finite 
group. Then 


IG] =|Z(@|+ $0 1G: C (a) 


iel’ 
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where the sum is taken over a complete set of representatives of the 
conjugacy classes of elements of G not in the center of G. 


Proof. Let {X;}ie7 be the conjugacy classes of G. Then we know 
that these partition G, and so 


IG] = DU # (Xi) = DIG: C (a)] 
iel iel 
by Corollary 2.8.15. 
Now if z € Z(G) then C(a;) = G (and conversely), so in this 


case |G: C(a;)] = 1, and also in this case the orbit of x consists of x 
alone (and conversely). Thus we may rewrite this sum as 


IG)= S> 14+ 5° 1G: C(x) =|Z(@|+ Y= [G: C(av)} 


LEZ (G) eel’ acl’ 


as claimed. 


Remark 2.8.18. We have been very careful here. Given an action 
®: G > Aut(X), we have let yy = ®(g) so for each g € G, and each 
xz € X we have the result y,(z) of the action of the element g on z. 
When ® is understood, y,(x) is often (indeed usually) abbreviated 
to g(z). > 


2.9 Structural results on Nonabelian groups 


As opposed to the case of finite abelian groups, where we fully 
described their structure in Section 2.6, there is no way to fully 
describe the structure of finite nonabelian groups. But there are 
important things we can say, and we say some of the most important 
ones in this section. (Actually, our results here are true whether or not 
the group is abelian, but in the abelian case they tell us nothing new.) 


Theorem 2.9.1 (Cauchy’s theorem). Let G be a finite group of 
order n, and let p be a prime dividing n. Then G has an element of 
order p. 


Proof. (McKay) Let G? = G x--- x G, where there are p factors, 
and let 


X= {et € G 


fta-¢ 


i=1 
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Note that #(X) = |G|P-! as we may choose g1,...,gp)—1 arbitrary 
p 


and then g, = (91 ... 9p-1)71- 
We now define an action of the group Z, on X by “rotation”: 
If Zp = {[0] ,[1],-.-,[p — 1]} then 


Pil] (91, 92, tee Op) = (Gps 91> 925 tee Gp) 


and yy) = gh. Note this makes sense as yp) = a = id. Clearly 
this is an action of Z, on G? but it is also an action of Z, on X, since 
if (91,---,9p) © X, 91)(91,---.9p) € X as we see from the following 


argument: Suppose gj ... 9) = e. Then 
CpG Gp-a= (Ong a.itp-a) GOs Gp Gi a, 
= eo, =Gp9,° =e 
Let {Xi }ie7 be the distinct orbits of X under this action of G, and 


let {2;};e7 be a complete set of coset representations of the orbits. 
Then by Corollary 2.8.15, 


|X| = $5 [Zp: Stab(ai)] 
wel 

Now Stab(2;) is a subgroup of Zp, and p is prime, so is either {[0]} 
or Zp itself. Let ny be the number of x; for which this index is 1 (i.e., 
for which Stab(z;) = Z,) and n, be the number of 2; for which this 
index is p (i.e., for which Stab(2;) = {[0]}). Then 

|X| =nm1-1+np-p 

Now |X| = |G|?~* is divisible by p, so nj is divisible by p. Let 
Z = (91,---,9p) be an element of X with Stab(x) = Z,. Then 
pry(e) = g, Le., 


(91; 92, re) Gp) _ (9p; 91, 92; cee Opt) 
LG Ot = Yor 0G = Giysis5 Oe = You1; Which implies 91 = 99 = >> = 
Jp—1 = Y- Call this common value g. Then x = (g,...,g) and, since 


x © X, g? = e. Conversely, any element « of X of this form has 
Stab(z) = Zp. Thus we see that if S = {g ¢ G| g? = e}, then nj 
= #(5). 

As we have just seen, nj, is divisible by p. But we cannot have 
n, = 0, as e€S. Thus n, > p. Choosing go to be any element of S 
other than go = e, we see that go is an element of G of order dividing 
p, and hence, since p is a prime, of order (exactly) p. 


Now we turn to the study of groups of prime power order. 
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Definition 2.9.2. A group G is a p-group if the order of G is p”, 
for some prime p and some positive integer n. % 


Lemma 2.9.3. Let G be a p-group. Then the center Z(G) of G 
is nontrivial. 


Proof. If G is abelian, then Z(G) = G is certainly nontrivial. 
Suppose not. Consider the equation class equation for G (Corollary 
6.16): 


IG] =|Z(@)|+ So 1G: € (a). 
wel! 

Now |G| is divisible by p. In the summation, each C(2;) is a proper 
subgroup of G, so its index [G: C'(a;)] is greater than 1, and divides 
|G|, so is also divisible by p. Thus every term on the right-hand side, 
except possibly |Z(G)|, is divisible by p, so |Z(G)| is divisible by p 
as well; in particular, Z(G) is nontrivial. 


If G has order p, then we know that G is cyclic, so in particular, 
it is abelian. What if G has order p?? 


Corollary 2.9.4. Let G be a group of order p*. Then G is isomorphic 
to Zy2 0rZy X Zy. In particular, G is abelian. 


Proof. Since a priori G may not be abelian, we will write G 
multiplicatively. 

Let |G| = p?. We ask whether G has an element a of order p?. If 
so, G = {e, a, a”,... apt } is cyclic of order p?, so is isomorphic 
to Z,2. 

Suppose not. By Lemma 2.9.3, Z(G) is nontrivial. Let a € Z(G), 
a # e. Then a has order p, so a generates a subgroup H; = {e, 
a, ..., a} of G of order p. Now let 8 € G, 8 ¢ Hy. Then B 
generates a subgroup H» = {e, 8, ..., 8?~'} of G of order p. Then 
Hy Hz = {e}, as Hy M Hz is a proper subgroup of H, (and of 
Hy). Since a is in the center of G, a commutes with every element 
of G; in particular a commutes with 6, and consequently every ele- 
ment of H, commutes with every element of H2. This readily implies 
that H,Hp2 is a subgroup of G. (Alternatively, since H, is a normal 
subgroup of G, H;H2 is a subgroup of G by Lemma 2.4.18.) 

Also, |H1H2| = p? = |G| by Lemma 2.4.17, so G = Hi H2. But 
again, since every element of H; commutes with every element of 
H», G is a product H, x H2 so is isomorphic to Zp x Zp. 


112 An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


(For use in our next result, let us make the isomorphism explicit. 
Let Zp = Ce ere alia and also Z, = 1B Upictgb? Then 
we have an isomorphism y: Z, x Zp — G given by y((a,e)) = 
a, y((e, b)) = B, and in general y ((a’, b’) = a’ 87). 


What about groups of order p?? We show that there is a non- 
abelian group of order p® by explicitly constructing it as a semidi- 
rect product of a subgroup isomorphic to Z, and a normal subgroup 
isomorphic to Zp X Zp. 


Example 2.9.5. Let No = Z, x Zp», which we write multiplicatively, 
No = {(a’, b’)|0 <i < p—1,0 <j < p—1}, where a? = e and P = 
e, as in the proof of Corollary 2.9.4. Then No has an automorphism 
y: No - No given by y((a, e)) = (a, 6) and y((e, b)) = (e,); 
more generally y((a’, b’)) = (a’, b’ +7). It is straightforward to check 
that y is indeed an automorphism of No, and moreover that vy? = 
id: No—No. Thus, letting Ho be another copy of Z,, which we also 
write multiplicatively, Ho = {e,c,...,@~} with @ = e, we have 
a homomorphism ®: Hyp—Aut(No) defined by ®(c) = y and, more 
generally, ®(c*) = y*. Then the semidirect product Ho x No defined 
as in Lemma 2.4.28 is a nonabelian group of order p’. 

Referring to the calculations in Corollary 2.4.29, we see that if G 
is the group G = {y*a' BI |0 << k <p—-1, 0<i<p-1,0<j<p-l} 
with multiplication given by 


y=e, aP=e, BP =e, a8 =fBa,yBy' =B6, yay =a8 


then we have an isomorphism ~w: Ho x No —> G, given by 


W (c, (e,e)) = wle, (a, e)) =a,  (e, (e, b)) = B, 
and in general w (e, (ar b’)) = ykKo' pd, 0) 
Here is a general result about p-groups. 


Lemma 2.9.6. Let G be group of order p*, k > 1. Then there are 
normal subgroups {e} = No C Ni C -:: C Ne = G of G with 
|Nj| =p', t=Ojstack 


Proof. We prove this by induction on k. 
In case k = 1, we certainly have {e} = No CM =G. 
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Now suppose the lemma is true for all groups of order p*~! and let 
G have order p*. By Lemma 2.9.3, Z(G) is nontrivial. Then Z(G) is a 
p-group, so in particular its order is divisible by p. Then by Cauchy’s 
theorem (for abelian groups, the easy case of this theorem), Theorem 
2.6.1, Z(G) has an element of order p. Let H be the cyclic subgroup 
generated by this element, let Q = G/H be the quotient group, and 
let 7: G > Q be the quotient map. Then Q is a group of order p*~!. 
By the inductive hypothesis Q has normal subgroups Mp C My --- C 
Mg_1. Let No = {e} and fori=1,...,k, let N; = 1~'(Mj_1). Then 
No C Ny Cc -+: C Nyx is as claimed. 


By Theorem 2.6.4, a finite abelian group G is the direct product of 
its subgroups of prime power order. There is no corresponding result 
for finite groups in general, but, nevertheless, the prime-power-order 
subgroups of a general group G play an important role in deter- 
mining, and analyzing, its structure. We now prove the fundamental 
theorem about these subgroups. 


Definition 2.9.7. Let G be a group of order n = p*m where k > 1 
and m is relatively prime to p. 


(a) A subgroup H of G is a p-subgroup if H has order p/ for some j 
with 1<j<k 
(b) A subgroup H of G is a p-Sylow subgroup if H has order p*. © 


Theorem 2.9.8 (Sylow). Let G be a group of order n = p*m, 
where k > 1 and m is relatively prime to p. 


(a) Ghas a p-Sylow subgroup, and every p-subgroup of G is contained 
in some p-Sylow subgroup. 

(b) The p-Sylow subgroups of G are all mutually conjugate. 

(c) The number of p-Sylow subgroups of G is congruent to 1 modulo 
p and divides the order of G (or, equivalently, divides m). 


Proof. We first show that G has a p-Sylow subgroup. 

We prove this by complete induction on n = |G|. 

If n = p* then G itself is a p-Sylow subgroup. Suppose that n > 
p*. Consider the class equation for G (Corollary 2.8.18), 


|G] = |Z (@|+ $2 [G: C(a)]. 


ael’ 
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If G is abelian then G = Z(G) and G has a p-Sylow subgroup by 
Theorem 2.6.14. Suppose not. 
There are two possibilities: 


(1) For some 2;, [G: C (2;)] = 7 is not divisible by p. Then |C (x;)| = 
p*m! with m’ = m/j < m. Then by the inductive hypothesis 
C(a;) has a p-Sylow subgroup H, i.e., a subgroup of order p*, 
and then H is a p-Sylow subgroup of G. 

(2) [G: C (a;)] is divisible by p for every 7. Then, since | G| is divisible 
by p, |Z(G)| is divisible by p as well. Now we argue as in the 
proof of Lemma 2.9.6. Z(G) has an element of order p, hence a 
(cyclic) subgroup of H of order p, which is a normal subgroup of 
G. Let Q = G/H and let 7: GQ be the quotient map. Then 
Q is a group of order p*~!m < n, so by the inductive hypothesis 
Q has p-Sylow subgroup, i.e., a subgroup K of order p*~!. Then 
H = x~'(K) is a subgroup of G of order p*, ie., a p-Sylow 
subgroup of G. 


Thus, by induction, every finite group has a p-Sylow subgroup. 
Now we establish the more precise claims of the theorem. 


(a) Let P be a p-Sylow subgroup of G. Let 
X = {subgroups of G conjugate to P}. 


Since every element of X is a subgroup of G conjugate to P, and 
conjugate subgroups have the same order, every element of X is 
a p-Sylow subgroup of G. (Part (b) of the theorem tells us that 
in fact X consists of all of the p-Sylow subgroups of G, but we 
don’t know that yet.) 

Now G acts on X by conjugation, and by the definition of X, 
there is only one orbit, i.e., this action is transitive. Then by 
Lemma 2.8.14, #(X) = [G: Stab(P)]. But we certainly have 
that P C Stab(P), so [G: Stab(P)] divides [G: P] = |G|/ 
|P| = m. In particular, #(X) divides |G| and is relatively 
prime to p. 

Now let H be any p-subgroup of G. We show H C P?’ for 
some P’ € X. To that end, consider the action of H on X by 
conjugation. Then, if {z;} is a complete set of representatives of 
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orbits of H on X, we have by Lemma 2.8.15, 


#(X) =) 0 [H: Stab(z;)]. 


wel 


We have established that #(X) is not divisible by p, so some 

term on the right hand side is not divisible by p. But [H: Stab(2;)] 
divides | H|, which is a power of p, so that term must be equal to 1. 
Thus there is some p-Sylow subgroup P’ of G with hP’h~! = P’ 
for every h € H, i.e., with HP’ = P'H. Then by Lemma 2.4.18, 
HP’ is a subgroup of G, and then by Lemma 2.4.17, |HP’| = 
A\|P'|/|HOP"| = |P'| (|| /|H9P’|) = p* |H: HOP’). Now 
on the one hand HP’ is a subgroup of G, so |HP’| must divide 
G| = p*m. On the other hand [H: HM P’| must divide |H], 
and, since H is a p-group, [H: HM P’] must be a power of p. 
But p* is the highest power of p dividing, |G|, so we must have 
A: ANP|=1,ie, HNP’ =H, i-e., H C P’, establishing (a). 
Note that the p-Sylow subgroup P’ is an element of X, ie., is a 
p-Sylow subgroup conjugate to P. So let H = P” be any p-Sylow 
subgroup of G. Then P” C P’ by (a). But |P”| = |P’| so P” = 
P’. Thus P” € X,ie., P” is conjugate to P. 
From (b), we know that the set X consists of all of the p-Sylow 
subgroups of G. We have already observed that #(X) divides 
|G'|. Now consider the action of P on X by conjugation. Again 
we have 


#(X) = $0 [P: Stab(2;)] 


jeJ 


where {z;} is a complete set of representatives of the orbits of P on 
X. Consider «€X. There are two possibilities: 


(1) 
(2) 


z = P. Then P is stabilized by all P, ie., Stab(P) = P, so [P: 
Stab(P)) = 1. 

z = P' # P. Let H = Stab(P’), a subgroup of P. The argument 
here parallels the argument for (a). HP’ = P’H, so HP’ is a 
subgroup of G, and then, as in (a), H C P’. But H C P, so 
H C P'P. Now |P| = |P’| and P’ 4 P so P'M P is a proper 
subgroup of P. Thus H is a proper subgroup of P, so in particular 
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we have [P: H] = [P: Stab(P’)| > 1, and since P is a p-group, 
we must have [P: Stab (P’)] divisible by p. Thus in the sum on 
the right-hand side, we have one term equal to 1 and every other 
term divisible by p, so the total, which is #(X), is congruent to 
1 (mod p). 


Theorem 2.9.8 had three parts. These are often called the three 
Sylow theorems. 


Corollary 2.9.9. Let G be finite group and let p be a prime dividing 
the order of G. Then G has a unique p-Sylow subgroup if and only if 
that subgroup is normal. 


Proof. Immediate from Theorem 2.9.8(b). 


Lemma 2.9.10. Let p and q be distinct primes with p < q. Let G be 
group of order pq. Then the q-Sylow subgroup S, of G is normal. 


Proof. The number m of g-Sylow subgroups must be congruent to 
1 (mod q), so m is either 1 or greater than gq. It must also divide |G], 
so m must be 1, p, or g. But m cannot equal g, and m cannot equal 
p as we are assuming p < q. Hence, m = 1. 


Corollary 2.9.11. 


(a) Let p and q be distinct primes with p < q. Ifq#1(mod p) then 
any group of order pq is isomorphic to the cyclic group Zpgq. In 
particular, any group of order pq is abelian. 

(b) Let p and q be distinct primes with p 4 +1(mod q) and q # 
1(mod p). Then any group of order p*q is abelian. 

(c) Let p and q be distinct primes with p # +1(mod q) and q # 

+1 (mod p). Then any group of order p*¢? is abelian. 


Proof. First note that in case (a), since p < q we have p # 
1(mod q). 

Let S, be a p-Sylow group of G and Sq be a g-Sylow subgroup 
of G. By Corollary 2.3.21, or by Corollary 2.9.4, S, and Sg are both 
abelian. By the Sylow theorems, the number of p-Sylow subgroups 
is not divisible by p and must divide |G|, so must be a power of ¢. 
Similarly the number of g-Sylow subgroups must be a power of p. 
But also the number of p-Sylow subgroups must be congruent to 1 
(mod p) and the number of g-Sylow subgroups must be congruent to 
1 (mod q). Under the given conditions p and q, the number of each 
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these subgroups must be 1, i.e., they must each be normal (Corollary 
2.9.9). But observe that |G| = |5S,||S,| and S, M Sy = {e} 
(as S, M Sq, is a subgroup of 5, and of Sz, so its order must divide 
both p and qg and hence must be 1). Then, by Lemma 2.4.17, G = 
SpSq and then, referring to Definition 2.4.21, we see that G = S, x Sy 
is the direct product of 5, and Sj. Finally, in case (a) we conclude 
from Theorem 2.2.15 that G is cyclic. 


In the cases where p and q do not satisfy the conditions of Corol- 
lary 2.9.11, we will construct examples where G is not abelian. 


Example 2.9.12. 


(a) Suppose that gq = 1 (mod p), so that p divides gq — 1. We know 
that Aut(Z,) is isomorphic to Zj, of order q—1, so by Cauchy’s 
theorem (for abelian groups), Z; has an element of order p, i-e., 
there is an automorphism y: Z, — Z, of order p. Then if ®: Z, + 
Aut (Z,) is defined by © ([k]) = ¢*, from Lemma 2.4.28 we obtain 
a nonabelian group of order pq as the semidirect product of a 
subgroup isomorphic to Z, and a normal subgroup isomorphic 
to Zg. 

(b) If p =1 (mod qg) or g = 1 (mod p), then (a) gives us a nonabelian 
group of order pq, and talking the direct product of this group 
with Z, gives us a nonabelian group of order pq. 

This leaves the (harder and more interesting) case p = —1 (mod 
q). Note in this case p? = —1 (mod q). We claim that 
Aut(Zp X Zp) is a group whose order is divisible by g. Then we 
proceed as in part (a): There is an element y of Aut(Z, x Zp) of 
order q, and if ®: Z, + Aut (Zp x Z,) is defined by © ([k]) = y*, 
then again from Lemma 2.4.28 we obtain a nonabelian group of 
order p”q as the semidirect product of a subgroup isomorphic to 
Zq and a normal subgroup isomorphic to Z, x Zp. 

That it remains to prove our claim, and to do so we must care- 
fully investigate the group Aut(Z, x Z,). 

As in the proof of Corollary 2.9.4 we write 


Zp X Zp = {(a',) |O<i<p-1,0<j<p-1}. 


Recall we have the automorphism of Z, given by taking any 
element of Z, to its k-th power, for any k relatively prime to 
p. Clearly we may perform this automorphism on both factors, 
and we may choose the powers independently. Thus we have an 
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automorphism given by mn: Zp X Zp — Zp x Zp given by 
bmn ((255)) = (a,b) and hence Bnn((aé, b1)) = (a, bi") for 
any integers m and n, both relatively prime to p. We also have the 
automorphism we used in Example 2.9.5, which here we denote 
by a: Zp X Zp — Zy x Zp, given by a((a,e)) = (a,b), o((e,b)) = 
(e, b) and hence o((a’, b’)) = (a', b'+7). Finally, we have an auto- 
morphism T: Zp xX Zp + Zp X Zp given by 7((a,e)) = (e,6), 
1((e,b)) = (a,e) and hence 7{((a’,b’)) = (a’,b"). With these in 
hand we go to work. 

Any automorphism of Z, x Zp must take the identity element (e, 
e) to itself, so we have an orbit {(e, e)} consisting of the identity 
element alone in the action of Aut(Z, x Zp) on Zp x Zp. Consider 
any element of Z, x Zp other then the identity. We distinguish 
three cases: 


(i) An element (a™, e). Then (a, e) = %m,1((a, e)). 

(ii) An element (e, 6”). Then (e, b”) = T(%n1((a, e€))). 

(iii) An element (a, 6”) with both m and n not divisible by p. 
Then (a, 6") = Umm(o((@, €))). 

Thus we see that all elements of Z, x Z, other than the identity 

are in the orbit of the element (a, e) of Zp x Zp in the action of 

Aut(Zp x Zp) on Zp X Zp. In other words, Z, x Zp — {(e,e)} is 

a single orbit in this action. (Otherwise said, Aut(Z, x Zp) acts 

transitively on Z, x Z, — {(e,e)}). Now Zp x Z, — {(e,e)} has 

p* — 1 elements, so by Lemma 2.8.14, 


p’ —1=# (Zp x Zp — {(e, €)}) = [Aut (Zp x Zp) : Stab (a, e))| 


so in particular p* — 1 divides |Aut(Z, x Z,)|, as claimed. 

In this case we have a nonabelian group of order p?q, or of order 
pq” (or both), by part (b), and then taking the direct product of 
this group with Z,, or with Z,, we obtain a nonabelian group of 
order p?q?. % 


Remark 2.9.13. In Example 2.9.12(b), the p-Sylow subgroup of G 
was normal and hence the group G, of order p?q was the semidirect 
product of its p-Sylow subgroup and its normal p-Sylow subgroup. 
This is no surprise. Suppose that p = —1 (mod q). Then, except in 
the case p = 2, q = 3, p > g. Thus q is the smallest prime dividing |G], 
and its p-Sylow subgroup, of order p?, has index gq, so this subgroup 
must be normal by Corollary 2.8.10. 
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In case p = 2, q = 3 this construction still works, so the p-Sylow 
subgroup may still be normal, but need not be: take G = Dg x Zo. 0 


One of the basic theorems of group theory is Lagrange’s theorem 
(Corollary 2.3.18): The order of the subgroup of a finite group must 
divide the order of the group. In other words, if G has order n and 
H is a subgroup of G of order d, then d divides n. It does not say 
that if d is any positive integer dividing n, then G has subgroup of 
order d. In fact, that is false, and the group constructed in Example 
2.9.12(b) provides a counter example. 


Example 2.9.14. Let p = —1 (mod q) and let G be the group of 
order p?q constructed in Example 2.9.12(b). We shall show that G 
does not have a subgroup of order pg. Let 5S, be a p-Sylow subgroup of 
G and S, be a q-Sylow subgroup of G. Then S;, is a normal subgroup 
of G isomorphic to Zp, x Zp and Sy is a group of order of q so is 
isomorphic to Z,. Let X = {cyclic subgroups of 5, other than {e}}. 
Note that S, has p? — 1 elements of order p, a cyclic subgroup has 
p—1 elements of order p, and two distinct cyclic subgroups of 5, only 
intersect in the identity element. Thus # (X) = (p? — 1) /(p—1) = 
p+. Also, since S, is a normal subgroup of G, it is the unique 
p-Sylow subgroup of G (Corollary 2.9.9), so by the Sylow theorem, 
Theorem 2.9.8(a), X is in fact the set of all subgroups of G of order 
p. Since conjugate subgroups have the same order, we see that Sy 
acts on X by conjugation. Then, if {X;};e7 is the partition of X into 
the orbits of Sj, and 2; is a representative of the orbit X;, we have 


p+1=#(X) = >) #(X%) = D5 [Sz : Stab (2:)]. 


iel iel 


Now every orbit has size dividing |.5,| = q, i-e., has size 1 or g. Note 
that q divides p + 1 (as we began with p = —1 (mod q)) so if there 
is an orbit of size 1, there must be at least q of them; in particular 
there must be at least 2 of them. Suppose this is the case, and let 
H, and H»2 be two subgroups of S, of order p with SqH1S,' = Ay 
and SqH2S7' = Ho. Then S,H; = HS, and S,H2 = H2Sq so by 
Lemma 2.4.18 both of these are subgroups of G, of order pq, and then 
by Corollary 2.9.11(a) these subgroups are abelian. (Note that the 
roles of p and g here are reversed compared to those in that Corollary: 
Here q is the smaller prime and p = —1 (mod q) sop # 1 (modq).) 
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Thus every element of Sy commutes with every element of H1, 
and with every element of H2, so with at least 2p — 1 elements of S,. 
But it is easy to check that K = {g € S,|gh = hg for every h € S,} 
is a subgroup of S,. Since any subgroup of must have order 1, p, 
or p’, we conclude that in fact K = Sp, Le., that every element 
of S, commutes with every element of S,, or that conjugation by 
any element of S, is the identity automorphism of S,. But this is 
impossible, as we constructed G as a semidirect product beginning 
with an automorphism y of 5, of order q. (Otherwise said, any two 
elements of S, commute with each other, and if every element of S, 
commuted with every element of S,, G would be abelian, and it is 
not.) Thus we see that this is impossible, and hence that every orbit 
X; must have qg elements, and so Stab(H) = {e} for every HEX. Now 
the subgroup Sy may not be unique, but note this argument holds 
for any g-Sylow subgroup of G. (In fact, Sy cannot be unique, as if it 
were, it would be normal, and then G = S, x Sg would be abelian, 
which it is not.) 

Now suppose G has a subgroup F of order pq. Then certainly F = 
HS, for some subgroup H, cyclic of order p, of G, and some subgroup 
Sq, cyclic of order g, of G. Let r be a generator of Sy. Then, on the 
one hand, rHr~! = H as F is abelian, while on the other hand, 
rHr-' = H # H asr ¢ Stab(H) ={e}; contradiction. 0) 


To conclude this section we introduce, and study the basic proper- 
ties, of a class of group known as solvable groups. The reason for this 
name is that they originally arose out of the question of being able 
to solve polynomial equations, and that is our interest in them here. 
We will be considering the question of solvability of polynomial equa- 
tions in Chapter 4, on field theory, and we introduce solvable groups 
in this chapter on group theory in order to have them available when 
we need them. 

It turns out that solvable groups are important in themselves, 
through we will not be studying them for their own sake. This once 
again illustrates the unity of mathematics. 


Definition 2.9.15. Let G be a group. 


(a) A subnormal series is a series of subgroups of G 


G=GoD.G,>D :-: IG, =i} 
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with G; < Gj_1 (ie., with G; a normal subgroup of Gj_1) for 
each i > 1. 

(b) A composition series is a subnormal series with G; a maximal 
proper normal subgroup of G;_ (i.e., there is no proper normal 
subgroup H of G;_1 with G; C H C Gj_1) for each i > i. © 


Definition 2.9.16. A group G is solvable if it has a subnormal series 
with the quotient groups G;_1/G; abelian for each i > 1. © 


Example 2.9.17. 


(a) Any abelian group is solvable as the abelian group G has the 
subnormal series G D {1} with G/{1} abelian. 

(b) Let G be a nonabelian group of order pq, with p and q distinct 
primes. Let p < q. Then, by Lemma 2.9.10, G has a normal 
subgroup G; isomorphic to Zj. Then, we have the subnormal 
series GD G, D {1} with G/G isomorphic to Zp and G/{1} 
isomorphic to Zg, both of which are abelian. 

(c) Let G bea p-group. Then Lemma 2.9.6 gives a composition series 
for G, so G is solvable. © 


Lemma 2.9.18. A finite group G is solvable if and only if it has a 
composition series with G;_1/G; cyclic of prime order for alli > 1. 


Proof. Certainly if G has such a composition series it is solvable. 

Conversely, suppose that G is solvable and consider a subnormal 
series G= Gp D G1 D--: D Gy = {1} with each G;/G,_, abelian. 
Suppose that for some i, H = G;/Gj_1 is not cyclic of prime order. 
Let 7: G; > H be the quotient map. 

Let p be a prime dividing |H|. Then by Cauchy’s theorem (for 
abelian groups — the easy case) H has an element of order p. Let K 
be the subgroup generated by this element, so |K| = p. Since H is 
abelian, K < H. Let Gj_1 =~! (K). Then we have 


G2 Gi > G1 


with GG, isomorphic to K and GIG 4 isomorphic to H/K, a 
group of smaller order than H. If H/K is of prime order, there is 
nothing more to do. If not, repeat this process. It eventually stops 
as we are decreasing the order of the group at every stage. 
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Then “refine” the original subnormal series by inserting all 
of these intermediate groups to obtain a composition series for 
G as claimed. 


Lemma 2.9.19. Let G and H be solvable groups. Then G x 
H is solvable. 


Proof. Let G= Go DGD -:: D Gy = {1} be a subnormal 
series for G and H = Hj D Hi D -:: D H, = {1} be a subnormal 
series for H. Then 


Gx HDG, x AD = Sf} x Si} xk Aho < Sfx {1} 
is a composition series for G x H. 


Theorem 2.9.20. 


(a) Let G be a solvable group and let H be a subgroup of G. Then H 
is solvable. 

(b) Let G be a solvable group and let Q = G/N be a quotient of G. 
Then Q is solvable. 

(c) Let G be a group and let N be a normal subgroup of G. If N and 
Q = G/N are solvable, then G is solvable. 


Proof. 
(a) Consider a subnormal series G = Go D G1 D --: D Gy = {1}. 
Let A; =A 1 GG. Then A = By 2 Ay > «+ SD Ap = {1}, 


We may eliminate the terms in which H; = H;_, to obtain a 
subnormal series. But then 


Aya A; = (19 Gi_-1)/(H M Gi) = (H M Gi_-1)/(2 M Gi-1) NG; 

= (fH GeGy Gs G6 Gif G; 

is isomorphic to a subgroup of an abelian group and 
hence is abelian. 

(b) Consider a subnormal series G= Gp D Gj D --: D Gy = {1}. 

Let 7: GQ be the quotient map and let Q; = 7(G;). Then 


Q=Q02Q12 -:: DQ, = {1}. We may eliminate the terms, 
in which Q; = Q;-1 to obtain a subnormal series. But then 


Q,;=G,N/N = G;/G;NN, so that 
Qi-1/Qi = (Gi-1N/N) /(GiN/N) 
= (Gy-1/Gu-1 A N)/(Gi/G; ON) 
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is isomorphic to a quotient of the abelian group G;_1/G; and 
hence is abelian. 

(c) Let 7: GQ be the quotient map. Consider subnormal series 
= O95 D Qe ={1) and N = Ng a D> A, =H 41h, 
Then we have the subnormal series for G. 


G=n 4(Qo) Dw '(Qi1)>D «++ 17" (Qz) 
=N > Ny Set Np = {1} 


and so G is solvable. 


Remark 2.9.21. At this point you may wonder if there are any 
groups that are not solvable. There are. We will see in the next 
section that the symmetric groups S, and the alternating groups 
An, both for n > 5, are not solvable. In fact, it is known that As, of 
order 60, is the nonsolvable group of smallest order. © 


2.10 The symmetric groups 


In this section, we will be considering the symmetric groups S,, n a 
positive integer. These are very specific groups and we will be proving 
most of our results by doing specific computations. 

Recall that 


Sq = Aut({1,...,0}) = {byections @: 1 1,..,,7} + {1,...,7}}). 


We will abbreviate {1,...,n} to N,, and we will call the action of 
S;, on N, given by o € S,, takes k € N,, to o(k) the canonical action. 


Lemma 2.10.1. The canonical action of S, on Ny is transitive. 


Proof. If n= 1 this is trivial. Suppose n > 1. Then for any 7 with 
2<j<nwe have o € S, given by 


ali = 9,¢(7)=1,¢0)=t fort =1,...9%, 4 F 17 Fy. 


Thus the orbit of 1 is N,, and the action is transitive. 


We have observed that |S,,| =n! We now give a proof of this from 
the viewpoint of the canonical action. 
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Lemma 2.10.2. For any positive integer n, S, is a group of order n! 


Proof. By induction on n. 

If n = 1, there is only a single bijection a: Ny — Ny, namely the 
bijection given by o(1) = 1, so |.$1| = 1. 

Assume true for n — 1 and consider the canonical action of S,, on 
N,. By Lemma 2.8.14, we know that 


# (Orbit(n)) = [S, : Stab(n)]. 


By Lemma 2.10.1, Orbit(n) = Nn, so #(Orbit(n)) = n. Now 
Stab(n) = {o € S,|o(n) = n} and we have an isomorphism 
yp: Sn—1 > Stab(n) given as follows: 

Let o9 € Sy_1, so that a9 : Ny-1 — Ny_y is a bijection. Then 
o = (0) is the bijection o: N, — Ny, given by o(t) = oo(t) for 
1<i<n—1 and o(m) =n. 

Then by the induction hypothesis |S,_1| = (n — 1)! and then 
|S,| = n(n — 1)! = n!, and by induction we are done. 


Remark 2.10.3. A bijection from a set X to itself is often called a 
permutation of X. Thus in this language 5S, is the group of permu- 
tations of N,. © 


We now introduce two notations for permutations. The first one 
is logically simpler, and is unambiguous. The second is more com- 
plicated, but is also more illuminating, and is the notation most 
commonly used. 

The first is the “two-line” notation. An element o of S,, is a func- 
tion 0: N > N,,, and the two-line notation for o is essentially a table 
of values: 


For example 
_ 12 3 4 5 
OMe: Bt bd 
is the function o: Ns; > Ns given by o1(1) = 2, 01(2) = 3, 01(3) = 1, 
o1(4) = D, o1(5) = 4. 
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This notation makes it easy to compose functions. (Remember 
that composition is the group operation in S,.) Note that co’(i) = 
a(o’(i)) so that we apply the permutation on the right first. 

For example, if o, is the permutation above and go is the 


permutation 
_ 12 3 4 5 
eSNG oe Ge ay? 
then o109 is 
_ ill <2) 
0102 => 9 3 
2 


fil 

a 5 
as 0109(1) = 01(02(1)) = o1(2) = 3, 0102(2) = 01(09(2)) = o1(4) = 
5, etc. We may perform this computation simply by following the 


arrows in the following diagram, where we just indicate how to com- 
pute 0102(1): 


ew Re ow 
Be ote 
bo ot 
ae 


Composition of functions is not in general commutative, and so 
there is no reason to expect that oj02 = o20;, and in fact in this 
example it is not: 


1 2 
0201 = 2 4 
2 


fil 
N\A 8 


The identity element of S, is just the identity function 0: N, > 
N,, given by o(i) = 7, for each 7, so, for example, the identity element 


of Ss is 
12 3 4 5 
123 4 57° 
Also, in this notation it is easy to find inverses. If 7: N, > Ny is 
a bijection, then o~!(a(i)) = i for each i, ie., if the value of o on 7 


NW Www 
ee of 
re oO 
NY 
ao 
Ne 
w rw 
Re ww 
oe 
Hy Ot 
NY 
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is o(i), then the value of o~! on a(i) is i. So, looking at the two-line 
notation, o~! should be given by 


o(1) o(2) a(n) 


except that the top lines is out of order, so rearranging it gives, 


= 1 2 n 
~ Not) «7 }(2) o-}(n) 


For example, if 01 is as above then 


4/2315 4) 12345 
i Co a ea eae 


and if o2 is as above then 
4 243 5 1 12 3 4 5 
0, = a : 
Ll 2 3A oi 2 A 
We now shift gears and develop the notion of a cycle. On the 
one hand, the “cycle structure” of a permutation gives us essential 
information about it, and on the other hand, this will lead us to 
our second notation for a permutation, the “cycle notation”. In fact, 


the reason this notation is preferred is because it displays the cycle 
structure. 


Definition 2.10.4. A permutation o € Si), is an r-cycle if there is a 
subset C = {71,...,%7,} of N, of cardinality r with 
o(i1) = i2, 0(t2) = 03,-.-. Ota 4) = ep Op) = ay 
and o(i)=i fori€C. 
We denote this r-cycle by (i; 72 ...%,). We say that (71,...,%,) is 


the cyclic order on C' given by o and that in this order 71 precedes 
19,...,%p—1 precedes 7,, and 2, precedes 71. © 


Remark 2.10.5. 


(a) Note that any 1-cycle is the identity. 
(b) Note that this notation is not well-defined, as 
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(ivi... ip—pip) = (igig.... ipiy) = (iia... iyég) = + 
= (ipiy ...ip—pip—1) 


and all of these give the same cyclic order on C’. © 


Remark 2.10.6. Observe that an r-cycle is an element of S, of 


order r. © 
Definition 2.10.7. Two cycles 0, = (i1,...,i-) and og = 
(915 +++5 93) are disjoimt if 127.045 %} 4 9i)4+.4 9s) =O. © 


Lemma 2.10.8. Let o, = (i1,...,%,) and og = (j1,...,js) be dis- 
joint cycles, and let 03 = 0201. Then a3 is given by 


o3(k) =oi(k) ifk € {t1,..., tr}, o3(k) =o0(k) ifk € {fi,---, Js}, 


o3(k) = o1(k) = o9(k) = k otherwise. In particular, disjoint cycles 
commute. 


Proof. Direct computation. 


Lemma 2.10.9. Everyoa € S;, can be written as a product of disjoint 
cycles, with the only ambiguity being given by Remark 2.10.5 and 
Lemma 2.10.8. 


Proof. Let H be the cyclic subgroup generated by o. Then, in the 
canonical action of S, on N,, if the orbits of H on N,, are C),...,C, 
these sets are the cycles in a decomposition of a into a product of 
cycles, and the action of o on each of C1,...,C; gives the cyclic 
order on that cycle. Furthermore, products of disjoint cycles that 
differ by more than in the statement of the lemma give distinct 
elements of S;,. 


For example, referring to the elements 0, and a2 above, we may 
write 


o1 = (123)(45) = (231)(45) = (312)(45) 
= (123)(54) = (231)(54) = (312)(54) 
= (45)(123) = (45)(231) = (41)(312) 
= (54)(123) = (54)(231) = (54)(312) 


128 An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


and 
og = (1245) = (2451) = (4512) = (5124) 
=(1245)(3):= (245198) = (45.1213) = (51248) 
= (3)(1245) =(3)(245 1) = (3)(4512) = (3)G124). 


By convention, we do not write cycles of length 1, so if some 
integer 7 does not appear in a decomposition of a permutation o into 
a product of disjoint cycles, we understand that 7 lies in a 1-cycle (i), 
ie., that o(i) =i. 

Again we can multiply permutations, remembering that cycle 
notation is encoding the values of a function, and that multiplication 
is composition, again from right to left. For example, given o; and 
a2 as above, we wish to compute 


03 = 020; = (1245)(123)(45). 


Again we perform the computation by following arrows. We first 
compute o3(1): 


(12345) (13 23) (45) 


so we see o3(1) = 4 and we have a partial cycle (14. Since we wish to 
express our result as a product of disjoint cycles, we next compute 
o3(4): 


(1245) (123) (435) 


Thus, 03(4) = 1 and we have the cycle (14) in 03. So next we 
compute 03(2): 


(1245)(12 > 3)(45). 


Thus 03(2) = 3. Then we compute (3): 


¥ \ 
(1 3245) (123) (45) 


Thus 03(3) = 2 and we have another cycle (23) in o3. Thus so far 
we have the product (14)(23) in a3. Now in this case we only have 
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a single element, 5, left over, and 03 is a bijection, so we must have 
o3(5) = 5, but we compute it anyway: 


¥\ 
(124-5) (123) (45) 


Thus, recalling our conventional of not writing 1-cycles, we see 
o3 = (14)(23), and you can check that this agrees with our previ- 
ous computation. Similarly, 0102 = (13)(25), also agreeing with our 
previous computation. 

In this notation, it is easy to find the inverse of a cycle — we just 
reverse the cyclic order, so that we rewrite the cycle from right to 
left. That is, (i; ... i-)~! = (4, ... 4). In our examples we have 


G =60E21) =0132\(45) 
6, 6421201549). 


Finally, we note that, under our convention of not writing 1-cycles, 
the identity element of S$, is (blank space), the empty product. 
Now let us carefully investigate Sj. 


Definition 2.10.10. Let o € S;,. The cycle structure of o is the set 
(repetitions allowed) of lengths of the cycles in a decomposition of 
into a product of disjoint cycles. © 


For example, 0; has cycle structure {3,2}, 02 has cycle structure 
{4,1}, and o3 = 020, has cycle structure {2,2, 1}. 


Lemma 2.10.11. Two elements o and o’ of Sy, are conjugate if and 
only if they have the same cycle structure. 


Proof. Let a be any element of S,,. If 
= Ci ecb ig, Gwe ag lm 
then 
aga' = (a(iq1)... a(itp, ))(a(ia1) ... W(igpg)) . 
has the same cycle structure as o. And if 


= (Gin 5.9, | Gai veins) 223 
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has the same cycle structure as a, then o/ = aca! where a is 
defined by 
a(ii1) = ji,---, @(ttry) = Jar, a(é21) 


= Pigs oy 8 Vig ) apg 


Corollary 2.10.12. 


(a) For any fired value of r, any two r-cycles in S;, are conjugate. 

(b) Ifo is an r-cycle and m is any integer relatively prime to r, then 
a and o™ are conjugate. 

(c) Any element of S, is conjugate to its inverse. 


Proof. This follows directly from Lemma 2.10.12, noting in partic- 
ular that that lemma states that two elements of S,, with the same 
cycle structure are conjugate, regardless of the cyclic order in each 
of the cycles. 


Definition 2.10.13. A 2-cycle r = (ij%2) in S, is called a 
transposition. 0) 


Lemma 2.10.14. 


(a) An r-cycle o in S, can be written as the product of r—1 trans- 
positions. 
(b) Any element of S, can be written as a product of transpositions. 


Proof. 


(a) By induction on r. If r = 1, o is the identity and is the empty 
product. If r = 2, o is itself a transposition. Now assume the 
theorem is true for r — 1, and let o be an r-cycle, o = (i1... iy). 
Direct computation shows 

G= (i1 sas he) = (i172) (zg <a Die) 
Now (i2 ... 7,) isan r—1 cycle, so is a product of r—2 transposi- 
tions, so a is a product of r—1 transpositions. Then by induction 
we are done. 

(a) By Lemma 2.10.9 any o € S, is a product of cycles, and by (a) 
any cycle is a product of transpositions. 


Lemma 2.10.15. Let p be a prime. Let G be a subgroup of Sp such 
that 
(a) the canonical action of G on Np is transitive; or 
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(a’) G contains an element of order p; 
and (b) G contains a transposition. 
Then G = 55: 


Proof. First we show that (a) and (a’) are equivalent. 

To say that the action of G on N, is transitive is to say that N, 
is an orbit of G. Now N, has p elements, so this means that G has 
a subgroup of index p and hence that |G| is divisible by p. Thus G 
has an element of order p. 

Now observe that an element of S;, of order p must be a p-cycle, 
and that the subgroup of S, generated by a p-cycle acts transitively 
on N,. 

Thus G contains a p-cycle og and a transposition T. 

Let us suppose, for the sake of simplicity, that 7 = (12). Now for 
some k, o(1) = of(1) = 2. Again, for the sake of simplicity, let us 
suppose that o = (12... p). Now direct computation shows that 


oro = (12... p(12)(12... pF = (G + DG +2) 


for every j = 0,...,p — 2. Thus G contains the transpositions (1 2), 
(2 3), (34) ..., (p—1 p). But then G also contains the transpositions 


(23)(12)(23) = (13) 
(34)(13)(34) = (14) 


(p—1p)1 p-1)(p-—1p)=(1p) 


i.e., G contains the transpositions (12), (13),..., (1p). 
But then, for any j,k, G contains the transposition 


(Lk)(1j)(1k) = (9). 


In other words, G contains every transposition. But then, by 
Lemma 2.10.14(b), G contains every element of Sp, i.e., G = Sp. 


We observed in Lemma 2.10.14(b), and just used, that every ele- 
ment of S,, can be written as a product of transpositions. But we 
did not claim, and it is certainly not true, that any element of S,, 
can be written as a product of transpositions in a unique (or any- 
thing like a unique) way. For example, the identity element is the 
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empty product, but also, since a transposition is an element of order 
2, the identity is equal to 7? for any transposition 7. More interest- 
ingly, we have (as we also used) that (13) is a transposition, but it is 
also the product of three transpositions (13) = (23)(12)(23). Thus 
we see it is not even the case that when we write an element of S;, 
as a product of transpositions, the number of transpositions in the 
product is well-defined. As we shall soon see, what is well-defined is 
the parity of this number. In order to see this, we introduce another 
quantity. 


Definition 2.10.16. Let o € S, be a permutation. The number of 
inversions in o is the number of pairs of integers (7, 7) such that i < 7 
but o(7) > o(y). ?) 


This number is easiest to read off from the two-line notation for 
a. In the two-line notation, every time we see 


(oraeer ee) 


with a > b, that is an inversion. 

For example, in 0; above, 2 is to the left of 1, 3 is to the left of 
1, and 5 is to the left of 4, so there are 3 inversions in 0}. 

In a2 above, 2 is to the left of 1, 4 is to the left of 3 and 1, 3 is to 
the left of 1, and 5 is to the left of 1, so there are 5 inversions in o9. 


Lemma 2.10.17. Leto € S, be a permutation and suppose that o is 
the product of t transpositions. Then t is even (resp. odd) according 
as the number of inversions in o is even (resp. odd). 


Proof. Of course, the identity element is the product of zero trans- 
positions and has zero inversions, so the lemma is true for it. We 
prove the lemma in general by showing that every time we multiply 
a permutation og by a transposition 7 on the left, the number of 
inversions in o = Too differs from the number of inversions in a9 by 
an odd number. 

Let + = (ab) and suppose that a is in position 7 and 6 is in 
position j in oo (ie., that oo(4) = a,oo(j) = b), with i < j. Let k 
be the number of positions between positions i and 7 for which the 
corresponding entry c is between a and 6. We obtain o from og by 
interchanging a and b. Let us see the effect of this on the number of 
inversions. 
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First suppose that a < b. Note that in this case we have k new 
inversions, since each c is greater than a, another k new inversions, 
since b is greater than each c, and one more new inversion, since b 
is greater than a. Thus in this case the number of inversions in @ is 
2k +1 more than the number of inversions in og. On the other hand, 
ifa > b then a similar argument shows that the number of inversions 
in o is 2k +1 less than the number of inversions in o9. Thus in 
any case the number of inversions changes by an odd number, as 
claimed. 


Given this lemma, the following definition makes sense. 


Definition 2.10.18. Let o € S, be a permutation and suppose that 
o is the product of t transpositions. Then o is even (resp. odd) as 
t is even (resp. odd). Furthermore, sign(c) is defined by sign(a) = 
(-ly. o 


Remark 2.10.19. We observe that sign: S, — {+1} is a 
homomorphism. © 


Up until now, everything we have said is valid for $,, for any 
positive integer n, including n = 1. Now Sj, is the trivial group, 
and there is really nothing more to be said about it, so in this case, 
everything we have said, though true, is superfluous. But henceforth 
we need n > 2 so that S;, has a transposition. 


Lemma 2.10.20. For n > 2 there are exactly two homomorphisms 
yp: Sp, > {+1}. These are the trivial homomorphism y(o) = 1 for 
every 0 € Sy, and the homomorphism y(a) = sign(o). 


Proof. First we observe that, since {+1} is an abelian group, if 0 
and o’ are conjugate elements of S,, then y(c) = y(a’). 

Let 7 = (12), a transposition, and set « = (7), € = +1. Ass 
special case of Lemma 2.10.11, all transpositions are conjugate in S,,, 
so y(T) = € for every transposition +. Now let o € S;, be arbitrary. 


Write o as a product of transpositions 0 = 7 ... %. Then y(c) = 
elt «.%) = O(n) «..+ OR) =e. 


Then we see there are exactly two possibilities: 


(1) ¢ =1, in which case y(c) = 1 for every o € Sy; or 
(2) « =—1, in which case y(o) = (—1)* = sign(c). 
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Definition 2.10.21. For n > 2 the alternating group Aj, is the sub- 
group of S, defined by 

A, = {even permutations in S,}. ) 


Lemma 2.10.22. For n > 2 A, is a normal subgroup of S;, of 
index 2. 


Proof. A, is the kernel of the homomorphism sign: S$, > {+1}, 
and this homomorphism is onto. 


Lemma 2.10.23. Let o € S, be an r-cycle. Then o is even if r is 
odd and a is odd if r is even. In particular, 0 € An if and only if r 
is odd. 


Proof. Immediate from Lemma 2.10.14(a). 


We now want to investigate the structure of Any. 
Lemma 2.10.24. For any n, Ayn is generated by 3-cycles. 


Proof. Since, by definition, any element of A, is a product of pairs 
of 2-cycles, it suffices to show that any product of pairs of 2-cycles 
is a product of 3-cycles. There are only two nontrivial cases: 


(ab)(ac) = (acd), 
(ab)(cd) = (acb)(acd). 


Lemma 2.10.25. Let G be any normal subgroup of An that contains 
a 3-cycle. Then G = Ay. 


Proof. We shall show that G contains every 3-cycle. Then the 
lemma follows immediately from Lemma 2.10.24. 

If n = 2, Ap = {id} is the trivial group. If n = 3, Ag = 
fid, (123), (132)}. 

Suppose n = 4. For simplicity suppose that G contains the 3- 
cycle og = (123). Up to renumbering, any 3-cycle o must be one of 
(123), (132), (124) or (142). Now (123) = oo and (132) = of. 
Also (124) = (142)?. Thus, we need only show that G contains 
(142). But that is true as 


((1.2)(34))(123)((12)(34))~* = (142). 


Suppose n > 5. Here we shall show that every 3-cycle is conju- 
gate to o9 = (123). Let o = (abc) be any 3-cycle in S,,. Note by 
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Lemma 2.10.11 that o is conjugate to 09 in S, (i.e., there is an ele- 
ment of a of S$, with ¢ = acga~') but that is not good enough—we 
want to show that o is conjugate to oo in Ay, (i.e., there is an element 
a of A, with ¢ = acgpa~'). But the proof of Lemma 2.10.11 tells us 
how to proceed. 

Let ap be any element of S,, with ag(1) = a, ao(2) = b, ao(3) = c. 
Note that n > 5, so let d = ag(4) and e = ao(5). Let ay be the 
element of S, with a,(1) = a, a1(2) = b, a1(3) = c, a1(4) = e, 
ay(5) = d, and aj(i) = ao(2), for i > 5. Note that a, = (de)ap (the 
product of the transposition (de) with the permutation ag). Now if 
we write ap as the product of tg transpositions, we see that we can 
write a, as the product of t; = tg +1 transpositions. Hence, ag and 
a, have opposite parity. Let @ be whichever of ag and ay is even. 
Then o = agga™! is conjugate to ap in An. 


Definition 2.10.26. A group G is simple if it has no nontrivial 
proper normal subgroups. © 


Theorem 2.10.27. Forn > 5, Ay is simple. 


Proof. Let G bea nontrivial normal subgroup of A,. We shall show 
that G contains a 3-cycle. Then the theorem follows immediately 
from Lemma 2.10.25. 

Since G is nontrivial, it has an element o of order k > 1. We prove 
the theorem by induction on k. 

Let k = 2. Since o has order 2, when written as a product of 
disjoint cycles, all the cycles have length 2, i.e., they are all trans- 
positions. Since o € A,, there are an even number of them. For 
simplicity, let 


o= (12\(34)a° 


where o’ is the product of the remaining transpositions, if any. 
Case 1: o = (12)(34). Then (and here we crucially use the fact 
that n > 5), o is conjugate to 


(12)(35)((1.2)(3.4))((12)(35))~* = (1.2)(45) 


and then the product 


((12)(34))((1 2)(45)) = (845) 


is a 3-cycle in G. 
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Case 2: 0’ is not the identity. In that case, o is conjugate in A, 
to 


(123)((12)(34)o’)(123)~! = (14)(23)o’ 
and then the product 
((12)(34)o")((14)(23)o") = (13)(24) 


is an element of G, so we may proceed as in Case 1. 

Next let & = 3. Then, when o is written as a product of disjoint 
cycles, o is a product of 3-cycles. If there is only one of them, we are 
done. Assume there are at least two. For simplicity, let 


o = (123)(456)o’. 
(In this case we must have n > 6.) Then 
o* = (132)(465)(0')? is in G, 

and also 

(124)((123)(456)o’)(124)? = (156)(243)o’ is in G, 
so their product 

((132)(465)(o’)?)((156)(243)o’) = (14263) is in G. 

But then 
(12)(35)(14263) ((12)(85))-? = (16524) is in G, 
so their product 
(14263)(16524) = (13)(56) is in G, 


and we are back in the k = 2 case. 

Next let k > 4. If k is composite, and o is an element of G of 
order k, let p be a prime dividing k. Then o*/? is an element of G of 
order p < k, so we are done by induction. Thus it suffices to consider 
the case k = p, a prime. Note p > 5; in particular p is odd. Note in 
this case that o must be a product of p-cycles. 
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For simplicity let 
S02 2. D)e joy nx. Oy 


ie., 01 = (12... p). We concentrate on 04. 
Now a is conjugate in A, to 


((12)(p—1p))(12 ... p)((12)(p—1p))~* = (13456 ... p-2 pp—1 2) 
and then 
(13456...p—2pp—12)(123...p—1p) 
= (246...p—3p35...p—4 p-—2) 


is a p—2 cycle. Performing the analogous operation on 02,...,0;, we 
find that we have an element of G that is a product of (p—2)-cycles, 
and by induction we are done. 


Corollary 2.10.28. Let G be a normal subgroup of S,,n > 5. Then 
G= {ep Ay. Or Sy: 


Proof. Suppose G C A,. Then G is a normal subgroup of Ay, so 
G = {e} or A, by Theorem 2.10.27. 

Suppose G ¢ A,. Let o € G, o ¢ An. Then o? € A,. Thus if 
o7 £e, GN A, # {e}. But the intersection of normal subgroups of 
S, is anormal subgroup of $,,, and hence of A,. Thus GM An = An. 
Thus A, C G. But A, is a subgroup of index 2 of S,, so G = Sy. 

Now suppose o? = e. Then, when written as a product of disjoint 
cycles, they must all be transpositions. Suppose 0 = 7172 ... Tj is 
such a product, with j > 1. For simplicity we may suppose 


o = (12)(34)73 ... 7). 
But then 
(13)((12)(3.4)r3 ... 7)(13)7! = (14)(23)73 ... 7) EG. 
and then 
((12)(3.4)r3 ... 7/)((14)(23)73 ... 7) = (13)(24) €G, 


so again GM A, # {e} and, as above, G = Sy. 

This leaves the case 7 = 1, ie., 9 = 7, is a transposition. But 
all transpositions are conjugate in S,, (Corollary 2.10.12) and every 
element of S,, is a product of transpositions (Lemma 2.10.14(b)), so 
in this case G = S}, as well. 
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2.11 Exercises 


il, 


We have written down the “multiplication tables” for the groups 
Zs and Z in Example 2.1.11, for the group Dg in Example 2.1.14, 
and for the group Qg in Example 2.1.15. Write down the multi- 
plication tables for each of the following groups. While there is 
no preferred order for the elements in a group, in each case use 
the given order. 


(a) (Za,+) = Za = {[0]a, [1], [2]a, [B]a} 

(b) (Ze, +) Ze = {(O]e; (16, [2], (Sle, [4]6, [5]6} 

(c) (Ze x Zo,+) Ze x Zo = {([O]a, [O]2), ([O]2, (12), 
([1]2, [O]2), (12, [1]2)} 

(d) (ZexZ3,+) Ze x Z3 = {([0]2, [0]3), ([O]2, [1]3), ([O]2, [2]s), 
({1]2, [O]3), (12, (13), (La, [2]3) 

(e) (Z3,-) = ZF = {[1)z, [2]7, [8], [Alz, [5]7, [6]7} 

(f) (Zg,-) = Zs = {[1]s, [3]s, [5]s, [7st 

(g) (Zg,-)  Z%5 = {[1]o, [2]o, (Alo, [5]o, [Z]o, [8] } 

(h) (P({a,>}),*} = P({a,b}) = {{ }, ta}, (Of, fa, ob F 

(i) Dg =< a, 8 | at — Lo" = l,aB = Ba-* aS 


(e;0;07 07,0, 08,e°2,07 8}. 


. (a) Show that the groups (Z4,+) and (Zé,-) are isomorphic. 


(b) Show that the groups (Zz x Z2,+), (Zg,-), and (P({a, }), *) 
are isomorphic. 

(c) Show that the groups in (a) and the groups in (b) are not 
isomorphic. 

(d) Show that the groups (Ze, +), (Z2 x Z3, +), (Zz, -), and (Z§, -) 
are isomorphic. 

(e) Show that the groups Dg and Qs are not isomorphic. 


. (a) For any n > 3, let ¢ = exp(27i/n), and let G be the subgroup 


of M2(C) generated by the matrices real and i a: 


Show that G is isomorphic to the dihedral group Don. 

(b) Let G be the subgroup of M2(C) generated by the matri- 
ces i | and Be A Show that G is isomorphic to the 
quaternion group Qs. 
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. Let G be a group and let X be a set. Let 
G* = {functions f: X > G}. 
Define an operation * on G* by 


(f *g)(z) = f(x)g(z) for every x € X. 


Show that (G, *) is a group. 

(Note that if |G| =n and #(X) =k, then |G*| = n*.) 

. As a special case of problem 4, let G = (Za, +) = {[O]e, [1]o}. As 

we have seen, we can identify G* with P(X) = {subsets of X} 

by f <> f71((1]J2). 

(a) Under this identification, if f + A and g +> B, show 
that h = fxg <> C = Ax B. In this way we see that 
(P(X), *) is a group. 

(b) What is the identity element of this group? If A € P(X), 
what is the inverse of A in this group? 

. Let G be a group and let g be a fixed element of G. Define a new 

operation * on G by ax b= agb. 

(a) Show that (G,*) is a group. What is the identity element of 
(G, *)? What is the inverse of a € G in (G, *)? 

(b) Show that the group (G,*) is isomorphic to G. 


. Let X be a set and let f: X > G be a bijection from X to a 
group G. Define an operation *« on X by 


vxy =f (f(x) f(y). 


Show that (X,*) is a group. What is the identity element of this 

group? What is the inverse of « € X in this group? 

(Note that problem 6 is the special case of this problem where 

f: G > G by f(a) =ag, or by f(a) = ga.) 

. Let f: P(X) > P(X) be the bijection f(A) = A°. Then by prob- 

lem 7 we may use this bijection to define a new group structure 

which we shall denote by (P(X), *°). 

(a) Show that A *° B= (Ax B)°. 

(b) What is the identity element of (P(X), *°)? What is the 
inverse of A in (P(X), *°)? 
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(c) Show that y: (P(X), *) > (P(X), *°) given by y(A) = A® is 


a group isomorphism. 


A semigroup (S, *) is a set S which is closed under an associative 
binary operation x. It is a cancellation semigroup if a*b=ax*c 
implies b = c for all a,b,c € S, and also bka = cxa implies b= c 
for all a,b,c € S. Show that a finite cancellation semigroup is a 
group. 

Let (H,*) be an abelian semigroup (i.e., a* b = b* a for all 
a,b € H). Then (H x H,* x *) is a semigroup. Define a relation 
~ on H x H by (a,b) ~ (c,d) ifaxd=b*c. 


(a) Show that ~ is an equivalence relation on H x H. 

(b) Show that G = H x H/ ~ is a group. (Note that if (H,*) = 
(N,+) then G = (Z,+) and if (H,+) = (N,-) then G 
(Q*,-), where Q* denotes the positive rational numbers.) 


Let G be a group and let a and b be elements of G. Show that 
ab and ba are conjugate. 

Let G be a group and let a and 6 be conjugate elements of G. 
Show that a and 6 have the same order. 

Let H be a subgroup of G of finite index k. 


(a) If H is a normal subgroup of G, show that g* € H for every 
geEG. 

(b) Give an example to show that (a) may not be true if H is 
not a normal subgroup of G. 


Let G be a group of odd order. Let g € G, g # e. Show that g is 
not conjugate to g™. 

Let G be a group of order n and let k be an integer relatively 
prime to n. Let y: G > G be the map (g) = g*. 
(a) If G is abelian, show that ¢ is an isomorphism. 


(b) If G is any finite group, show that ¢ is a bijection. 


Let y: G > G by v(g) = g7!. Show that y is a homomorphism 
if and only if G is abelian. 

Let G be a finite group and let H be a proper subgroup of G. 
Show that G is not a union of conjugates of H. 

Let G be a group and let H be a subgroup of G of index 2. Show 
that H is a normal subgroup of G. 


19. 
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Let mo, No, and gg be fixed nonzero integers. Let S be the subset 
of M3(Z) defined by 


Iomion m is divisible by mo, 
s= 0 1 @q n is divisible by no, 
00 1 q is divisible by qo. 


Under what conditions on mo,no and go is S a group under 
matrix multiplication? 

Show that D4, is isomorphic to Do, x Zo if and only if n is odd. 
For r = 1,3,5,7 let G, be the group 


G, =40,8 |e? = 1,8? = 1, p08 Sa’). 


Show that these are four pairwise non-isomorphic groups of order 
16. 
Let y: G; > G2 be a group homomorphism. 


(a) If Hj; is a subgroup of G1, show that y(H;) is a subgroup of 
Go. 

(b) If Nj is a normal subgroup of G and ¢ is onto, show that 
(Nj) is a normal subgroup of G2. 

(c) If Ho is a subgroup of G2, show that y~!(H2) is a subgroup 
of G. 

(d) If No is a normal subgroup of Gz, show that y~'(No) is a 
normal subgroup of Gj. 


Let y: Gy; + G2 be a homomorphism. Let N = Ker(y). If Hy is 
a subgroup of G, let H2 = y(H;1), a subgroup of G2. Show that 
Hp is isomorphic to H,/N NM Ay. 

Let y: Gj G2 bea homomorphism. Let N = Ker(y). If g € G2 
is an element of Im(y), show that y~!(g) is a coset of N in Gy. 
Let y: Gy — G2 be a homomorphism. Let H2 be a subgroup 
of Im(y). Show that Kp <—+ y~!(K2) is a 1-1 correspondence 
between (left or right) cosets Ka of Hz in Im(y) and (left or 
right) cosets of Hy = y~!(H2) in G4. 

Let H and K be subgroups of a group G. An (H, kK )-double coset 
of G is a subset of G of the form 


Hak = {hak|he H,k € Kk} 


for some element a of G. 
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(a) Show that {(H, K)-double cosets} is a partition of G. 
(b) Give an example to show that {(H,K)-double cosets} may 
not all have the same size. 


Let G be a group. Let A be the diagonal of G x G, 
A= {(9,9) € G x G}. 


(a) Show that A is a subgroup of G x G. 

(b) Show that A is a normal subgroup of G x G if and only if G 
is abelian. 

(c) Show that in this case the quotient G x G/A is isomorphic 
to G. 


Let Z(G) = {g € G| gx = xg for every x € G}. Z(G) is called 
the center of G. 


(a) Show that Z(G) is a normal subgroup of G. 
(b) More generally, let H be any subgroup of Z(G). Show that 
A is a normal subgroup of G. 


Let G and H be groups. Show that Z(G x H) = Z(G) x Z(H). 
Let G be a group and let Z(G) be the center of G. 


(a) If G has a normal subgroup N such that Z(N) = {e} and 
Z(G/N) = {e}, show that Z(G) = {e}. 

(b) More generally, show that |Z(G)| divides |Z(V)| |Z(G/N)|. 

(c) Give an example with N a nontrivial proper subgroup of G 
where we have equality in (b), and on example where we do 
not. 


Show that if G is not abelian, then G/Z(G) is not cyclic. 
Let G and H be groups and let y: G + H be a homomorphism. 
Let Ai(y) = {(9, 9(9)) |g € G} CG x A. 


(a) Show that Aj(y) is a group and that ®;: G > Aj(y) 
defined by ®1(g) = (g, y(g)) is an isomorphism. 

(b) Show that A;(y) is a subgroup of G x H. 

(c) Show that A;(v) is a normal subgroup of G x H if and only 
if Im(y) C Z(#). 
In an entirely analogous way, given a homomorphism w: 
H — G we may define Ao(w) = {(w(h),h)|h € H} CGxHA 
and ®2: H — Ao(w) by ®2(h) = (W(h), h). 
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(d) Determine under what conditions on y and 7 is G x H 
isomorphic to a semidirect product Ai(y) x Ag(w), or, 
analogously, to Ai(y) x Ao(w), and under what conditions 
on y and w is G x H isomorphic to the direct product 
Ai(y) x Ao(y). 


33. For a positive integer n, let 


L,,(R) = {A € M,(R) | det(A) 4 0} 
cus) {A € M,(R)|det(A) > 0} 
L,,(R) = {A € M,(R) | det(A) = 1}. 

). 


(a) Find Z(GL,,(R) 
(b) Show that GZ,,(R) is a semidirect product 


GL,n(R) = GL (R) x H 


with H a subgroup isomorphic to Zg, for every n. 
(c) Show that GL,,(R) is a direct product 


GL,(R) = GL*(R) x H 


with H a subgroup isomorphic to Zg, if and only if n is even. 
(d) Show that GL; (R) is a direct product 


GL* (R) = SL,(R) x K 


with K a subgroup isomorphic to Rt = {positive real 
numbers}, for every n. 


34. Let G = M2(C) and let U be the subgroup of G, 


v-fe ee 


Show that G is the union of conjugates of U. 
(Compare problem 17.) 
35. Let H be a subgroup of G. The normalizer Ng(H) is 


No(H) = {9 € G|gHg"' = H} 


(a) Show that Ng(#) is a subgroup of G. (Observe that H C 
Ng(#) and furthermore that H < Ng(H).) 
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(b) Show that there is a 1-1 correspondence between conjugates 
of H and left cosets of Nc(H) in G. Thus # (conjugates of 
H) =(G: No(H)). 


36. The commutator |a, b] of two elements of a group G is the element 
[a,b] = aba~'b~! of G. The commutator subgroup of G is the 
subgroup G’ generated by all commutators of G, i.e., 


G' = {TI lai. bil} 


) Show that G’ is a subgroup of G. 

) Show that G’ is a normal subgroup of G. 
(c) Show that the quotient G/G’" is abelian. 

) Let H be a subgroup of G with G’ C H. Show that H isa 
normal subgroup of G. 
(e) Let N bea normal subgroup of G. Show that G/N is abelian 
if and only if G’ CN. 


37. For a group G, let ®: G > Aut(G) be the map ®(g) = yg, where 
Yq: G > G by q(x) = grg”' for x € G. Such an automorphism 
Yq is called an inner automorphism of G. 


(a) Show that ® is a homomorphism. Hence Inn(G) = Im(®) is a 
subgroup of Aut(G). 

(b) Show that Inn(G) is a normal subgroup of Aut(G). The quo- 
tient Out(G) = Aut(G)/Inn(G) is called the outer automor- 
phism group of G. 


38. Find Inn(G), Aut(G), and Out(G) for G = Dg, Ds, Qs. 
39. For a positive integer n, show that 


Aut(Z") = GL,(Z) = {invertible n-by-n matrices 


with entries in Z}. 


(Here invertible means that the inverse must also have entries in 
Z.) 
40. (a) Let G be an abelian group with elementary divisors 2, 6, 6, 
12, 60, 300, 2100. Find the invariant factors of G. 
(b) Let G be an abelian group with invariant factors 2, 4, 8, 8, 
3, 27, 125. Find the elementary divisors of G. 
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Let M = Z? (written as column vectors) and let N be the sub- 
group of M generated by {21, 22} with 


36 40 
a | 172 = bal 


Find a basis {21,72} of M and positive integers s; and s2 with 
s1 dividing sz such that N has basis {s1 71,5222}. (Then M/N 
is isomorphic to (Z/s1Z) © (Z/s2Z)). 

Let M = Z* (written as column vectors) and let N = {v1,..., vp} 
be a set of k elements of M. Let A be the k-by-k matrix whose 
ith column is uj;,7=1,...,k. 


(a) If det(A) = 0 show that M/N is infinite. 

(b) If det(A) 4 0 show that M/N is a finite group of order 
|det(A)|. 

Let G be a finite abelian group. 


(a) If G has odd order, show that the sum of the elements of G 
is equal to the identity. 

(b) If G is a cyclic group of even order, show that the sum of the 
elements of G is equal to the unique element of G of order 2. 

(c) Let G be a group of even order, and write G = H®K, where 
|H| is a power of 2 and |K| is odd. Show that the sum of the 
elements of G is equal to the identity if and only if H is not 
cyclic. 


Let G be a finite abelian group. Let p be a prime. 


(a) Show that the number of elementary divisors of G that are 
divisible by p is equal to the number of invariant factors of 
G that are divisible by p. 

(b) If this common value is m, show that G has exactly p™ — 1 
elements of order p and that these elements, together with 
the identity, form a subgroup H of G that is isomorphic to 
Zy ® +: ®Zy, where there are m summands. 


Let G act on a set X. For an element x € X, let G, = {g € 
G| g(x) = 2}. 
(a) Prove Lemma 2.8.12: The relation x ~ y if y € Orbit(z) is 


an equivalence relation on X. 
(b) Show that G, and Gy are conjugate if and only if a ~ y. 
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(c) Show that, for any x € X, there is a bijection f: G/G, > 
Orbit(x). 

(d) Conclude that if any two of |G|,|G,|, and #(Orbit(x)) are 
finite, so is the third, and |G|= |G,.|#(Orbit(z)). 


(a) Let G = {symmetries of a regular tetrahedron}. Show 
IG| = 24. 

(b) Let G = {symmetries of a cube}. Show |G| = 48. 

Let p bea prime. Let V = (Z,)”, written as column vectors. Then 
GL,,(Z,p) acts on V by multiplying a matrix times a vector. 


(a) Show that GL,(Z,) acts transitively on {nonzero vectors 
in V}. 

(b) Let e; be the vector in V with first entry 1 and all other 
entries 0. For n > 1, show that G_, is the semidirect product 
of a normal subgroup isomorphic to (Z,)”~+ and a subgroup 
isomorphic to GLn_1(Z,). (For n = 1, Ge, is trivial.) 

(c) Use (b), and induction, to derive a formula for |GL,,(Zp)|. 


Prove Cauchy’s Theorem (Theorem 2.9.1) as follows: Let G have 
order n. Proceed by complete induction on n. 

If |Z(G)| is divisible by p, Z(G) has an element of order p by the 
abelian case of Cauchy’s Theorem (Theorem 2.6.1). 

Suppose that |Z(G)| is not divisible by p. Use the class equation 
(Corollary 2.8.18) to show that G has a proper subgroup H of 
order divisible by p, so that H has an element of order p by the 
inductive hypothesis. 

Let p and q be distinct primes with p < q. 


(a) If G is an abelian group of order pq, count the number of 
elements of G of order 1, p, q, and pq. 

(b) If G is a nonabelian group of order pq, count the number of 
elements of G of order 1, p, q, and pq. 


Let G be a finite group of order divisible by a prime p. For i > 1 
let s; be the number of elements of G of order p’. Show that 
8, =—1 (mod p) and s; = 0 (mod p) for 7 > 2. 

Let G be a finite group and let p be a prime dividing |G|. Let P 
be a p-Sylow subgroup of G. 


(a) Let N be a normal subgroup of G with |N| divisible by p. 
Show that PMN is a p-Sylow subgroup of N. 
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(b) Let H be a subgroup of G with |H| divisible by p. Show that 
gPg-! FH is a p-Sylow subgroup of H for some element g 
of G. 

(c) Let N be a normal subgroup of G with [G: N] not divisible 
by p. Show that N contains every p-Sylow subgroup of G. 

(d) Let N be a normal subgroup of G with |N| a power of p. 
Show that N is contained in every p-Sylow subgroup of G. 


a) Show that a group of order 56 cannot be simple. 

b) Show that a group of order 312 cannot be simple. 

c) Show that a group of order 600 cannot be simple. 

(d) Show that a group of order 1176 cannot be simple. 

Let G be a group of order p*m, p a prime, k > 1, p not dividing 

m. If p* does not divide (m— 1)!, show that G is not simple. 

(a) Let G be a p-group. Let g € G, g # e. Show that g is not 
congruent to g* for any k with k 4 1 (mod p). 

(b) More generally, let G have order n and let k be any integer 
with k” — 1 relatively prime to n. Let g € G, g # e. Show 
that g is not congruent to gh. 

Consider the following elements of S9, written in two-line form: 


ere 


“1 \ 384952167 
123456789 
ag= 
789623154 
123456789 
a3 = 
248719563 


(a) Find aja2, a2a1, A103, A301, A203, A3a2. Write your 
answers in two-line form. 

(b) By counting inversions, find sign(a1), sign(a2), sign(a3). 

(c) Write ay, a2,a3 as products of disjoint cycles. 


Consider the following elements of S9, written as products of 
disjoint cycles: 


By = (153892)(47) 
By = (1387)(259)(46) 
B3 = (148)(237)(569) 
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(a) Find (12, 6261, 2183, 6361, 283, 6301. Write your answers 
as products of disjoint cycles. 

(b) From the cycle structure, find sign((,), sign(G2), sign(S3). 

(c) Write 61, 82, 63 in two-line form. 


(a) Find all possible orders of elements of S,,, for each n = 
Peco 

(b) Find all possible orders of elements of A,, for each n = 
Teel 

(a) Find all subgroups of S4, up to conjugacy. (That is, find one 
representative of each conjugacy class.) Which are normal? 

(b) Find all subgroups of A4, up to conjugacy. (That is, find one 
representative of each conjugacy class.) Which are normal? 

Let G = S,, considered as the group of permutations of N, = 

{1,...,n}. Let My, be a subgroup of S,, with #(M;,) = k. Let 


Hy(M;) = {o € S,|o(m) = m for every m € M;,} 


Show that H)(M;,) is isomorphic to S,_, and that H2(M,) is 
isomorphic to Sp_z~ X Sz. 

Show that two elements of S;, are conjugate if and only if the 
cyclic subgroups of S;, they generate are conjugate. 

Let p be a prime. Show that S, has (p — 2)! p-Sylow subgroups. 
Let p be a prime. Describe the p-Sylow subgroups of S;,2. 

Let H be a subgroup of S,. Suppose that H contains an n-cycle, 
an (n — 1)-cycle, and a 2-cycle. Show that H = S,,. 

Let G = S,, considered as the group of permutations of N, = 
Alpasay tt hs 

Let H be a subgroup of S,, that acts transitively on N,. If |H| > 
n, show that H is not abelian. 


Chapter 3 


Ring Theory 


Having dealt with groups, we now move on to rings. Groups have 
one operation, while rings have two related operations, “addition” 
and “multiplication”. We will begin by studying rings in general, 
but will quickly turn our attention to “commutative rings with 1”. 
Among these is one of the most familiar and important objects in 
mathematics, the integers Z. On the one hand, we will be proving 
some of the most important properties of Z from a ring-theoretic 
viewpoint, and on the other hand, one of our principal concerns will 
be to see whether, and in what degree, these properties generalize. 
The study of the (positive) integers is a domain known as number 
theory. Number theory is a deep and beautiful subject in mathemat- 
ics whose study, historically, long preceded that of ring theory. But as 
an application of our ideas, we will be proving a couple of the (justly) 
most famous theorems of number theory by using results from group 
theory and ring theory. This not only gives a different perspective on 
these results, but demonstrates the essential unity of mathematics. 


3.1 Definition, examples, and basic properties 


We begin by defining rings. 


Definition 3.1.1. A ring R is a set with two operations, addition 
(+) and multiplication (-), such that (R,+) is an abelian group and 
in addition 


(1) R is closed under multiplication, i.e., a-b € R for every a,b € R. 
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(2) Multiplication is associative, i.e., a-(b-c) = (a- b)-c for every 
a,b,cE R. 

(3) Multiplication distributes over addition, ie., a-(b+c) = a-b+a-c 
and (b+c)-a=b-a+c.-a for every a,b,c € R. 
Risa commutative ring if R is a ring and in addition 

(4) Multiplication is commutative, i.e., a-b = b-a for every a,b € R. 
Risa ring with 1 (or ring with identity) if R is a ring and in 
addition 

(5) There is an element 1 4 0 in R such that a-1 = 1-a=a for 
every a € R. 
Ris a commutative ring with 1 (or commutative ring with iden- 
tity) if R is a ring and in addition both properties (4) and (5) 
bold for R. ») 


The trivial ring is R = {0} with 0+ 0 = 0 and 0-0 =0. Any 
other ring is nontrivial. 

Henceforth, we write ab for a- b. 

Here are some basic properties of arithmetic in a ring. 


Lemma 3.1.2. Let R be a ring. 


a) —(—a) =a for everyaeé R. 

b) 0a = a0 = 0 for everyae R. 

c) (—a)b = a(—b) = —(ab) for every a,b € R. 

d) If R is a ring with 1, (—1)a = a(—1) = —a for everya€ R. 
(e) (—a)(—b) = ab for every a,b € R. 


Proof. 


(a) (R,+) is a group. 
(b) 0a + 0 = 0a = (0+ 0)a = 0a + 0a so 0a = 0 by cancellation in 


(R, +). 
(c) 0 = 0b = (a+(—a))b = ab+(—a)b so (—a)b = —(ab) and similarly 
a(—b) = —(ab). 


(d) If R has 1, this is just the special case of (c) with b = 1. 
(e) (—a)(—b) = —(a(—b)) = —(—ab) = ab. 


Remark 3.1.3. It is easy to check (from Lemma 3.1.2(b)) that if 
R is a nontrivial ring, then an identity element for multiplication in 
R must be nonzero. Thus, the condition 1 4 0 in Definition 3.1.1 is 
simply because we do not wish to consider the trivial ring R = {0} 
to be a ring with 1. } 
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We now give a whole bunch of examples. 


Example 3.1.4. 


(a) 
(b) 


(c) 


The integers Z is a commutative ring with 1. 

OZ = {0} is a trivial ring and 1Z = Z. 

For n > 1, nZ is a commutative ring without 1. 

For n > 1, Zn, the integers (mod n), is a commutative ring with 
1. (Z, is a trivial ring.) Note that there is something to check 
here, as the elements of Z,, are equivalence classes. We showed 
in Chapter 1 that (Z,,+) is a group. We need to show that 
multiplication in Z, satisfies the properties in Definition 3.1.1. 
We leave this to the reader. 

For p a prime, Z,, the integers (mod p), is a commutative ring 
with 1. (Of course, this is just a special case of (c).) Also, the 
rationals Q, the real number R, and the complex numbers C are 
all commutative rings with 1. 

Let R be any commutative ring with 1. Then 


R{x] = {polynomials in x with coefficients in R} 


= {ap +ayx+--- + a,x” | a9,a1,.--, An € R} 


with the usual operations of polynomial addition and multipli- 
cation is a commutative ring with 1. 
Let R be any commutative ring with 1. Then 


R|{x]] = {formal power series on x with coefficients in R} 


where a formal power series is an expression UP° 4 a,x". We call 
these “formal” power series because (as opposed to the case of 
polynomials) they do not define functions but rather are just 
expressions. The operations of addition and multiplication are 
the usual ones with power series. R[[z]] is a commutative ring 
with 1. 

Let R be any commutative ring with 1. For any positive integer n, 


M,(R) = {n-by-n matrices with entries in R} 


with the usual operations of matrix addition and multiplication 
is a ring with 1. It is not commutative for n > 1. 
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(h) Let R be a commutative ring with 1. 
{diagonal matrices in M,,(R)} 
is a commutative ring with 1. 
{upper triangular matrices in M,,(R)} 


is a commutative ring with 1 if nm = 1 and a noncommutative 
ring with 1 ifn > 1. 


{strictly upper triangular matrices in M,,(R)} 


is a trivial ring if n = 1, a commutative ring without 1 if n = 2, 
and a noncommutative ring without 1 if n > 2. 
(A strictly upper triangular matrix is an upper triangular matrix 
all of whose diagonal entries are zero.) 

(i) Let G be an abelian group. An endomorphism of G is a group 
homomorphism y: G > G. Then 


End(G) = {endomorphism of G} 
is a noncommutative ring with 1 with operations 
(e+¥)(M =e) +49),  (Hv)(9) = (9). 
(j) Let R be a ring with 1, and let X be any set. Then 
{functions f: X — R} 
is a ring with 1, with operations 
(f + 9)(«) = f(x) + g(x) 
(fg)(«) = f(x)g(@). 


(k) For a ring R, let us denote by Og the zero element of R, and, if 
R has 1, let us denote by 1g the identity element of R. 
Let R and S be rings. Their product T = R x S is the ring 


T ={(r,s)|reR,s€S} 
with operations defined componentwise, i.e., 
(r1, 81) + (ra, 82) = (r1 + 2, 81 + $2), 


(71, $1) (2, 52) = (r112, $182). 
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Then T is a ring with 07 = (0r,0s). If R and S are both rings 
with 1, then T is a ring with 1 with 1p = (1p, 153). ?) 


Let us now look at some conditions on elements of a ring, and 
with those in hand have another look at the above examples, as well 
as introducing new ones. 


Definition 3.1.5. A nonzero element 7 of a ring R is a zero divisor 
if for some nonzero element s of R, rs = 0 or sr = 0. © 


Definition 3.1.6. An integral domain is a commutative ring R with 
1 that has no zero divisors. 0) 


Example 3.1.7. 


(a) The integers Z is an integral domain. 

(b) The ring nZ, for n > 2, has no zero divisors, but is not an integral 
domain, as it does not have 1. 

(c) If p is a prime, then Z, is an integral domain. Also, Q,R, and C 
are integral domains. 

(d) Suppose that n is composite, and let k be a divisor of n, k £ 1, 
n. Then [k]|[n/k] = [0] in Z,, and every zero divisor in Z,, is of 
this form. Thus, for n composite, Z, is not an integral domain. 

(e) If R is an integral domain, then R[x], the ring of polynomials 
in x with coefficients in R, and R[[2]], the ring of formal power 
series in x with coefficients in R, are both integral domains. 

(f) Let R = Q,R, or C and consider the ring M,(R) of n-by-n 
matrices with coefficients in R. Then A € M,,(R) is a zero divisor 
if A is a nonzero matrix that is not invertible. Similarly, in the 
rings 


{diagonal matrices in M,,(R)} 
{upper triangular matrices in M,,(R)} 


every nonzero matrix in that ring that is not invertible is a zero 
divisor, and in the ring 


{strictly upper triangular matrices in M,,(R)} 


every nonzero matrix in that ring is a zero divisor. 

(g) Let R and S be nontrivial rings and let T = R x S. Then every 
element of T of the form (r,0) with r ¥ 0, or of the form (0, s) 
with s £ 0, is a zero divisor in T’. © 


154 An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


Here is a second kind of element, that in a way is antithetical to 
zero divisors. 


Definition 3.1.8. A unit in a ring R with 1 is an element r of R 
such that there exists an element r’ of R with rr’ = r’r = 1. We set 


R* = {units of R}. .) 


Remark 3.1.9. 


(a) In the noncommutative case, if there is an element r’ of R with 
r’r = 1 and an element r” of R with rr” = 1, then r” = Ir” = 
nm =) =" 1 =r", so + 16 a Unit. 

(b) Also in the noncommutative case, it is perfectly possible that 
there exists an element r’ of R with r’r = 1, but there does not 
exist an element r” of R with rr” = 1, and vice versa. .) 


Definition 3.1.10. A field is a commutative ring with 1 such that 
R* = R— {0} (ie., such that every nonzero element of R is a unit). 

A division ring (or skew field) is a ring with 1 such that R* = 
= {0}. v) 


Lemma 3.1.11. Let R be a ring with 1. Ifr is a unit in R, then r 
is not a zero divisor in R. In particular, if R is a field then R is an 
integral domain. 


Proof. Let r bea unit of R, so that there is an element r’ of r with 
rr’ =r'r =1. If rs =0, then 


s= ls =("reH—r rsa 0) —0 
and similarly if sr = 0, then s = 0, so r is not a zero divisor. 


In particular, in a field, every nonzero element of R is a unit, so 
no nonzero element of R can be a zero divisor. 


Lemma 3.1.12. Let R be a ring with 1. Then R* is a group under 
multiplication. 


Proof. We check the group axioms. 
(Closure) Let r; be a unit in R, so that there is an element rj of 
R with rr) =rjri = 1, and let rg be a unit in R, so that there is an 
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element r4 of r with rerg = rgrg = 1. Then 

(rire)(rori) = ri(rerg)r, =ri(1)ry = rir, =1 
and 

(ror. (rire) =ro(rini)re =rh(1)re = rhro = 1 


so T17r9 is a unit in R. 

(Associativity) Multiplication in R is associative. 

(Identity) 1 € R is a unit. 

(Inverse) If r is a unit, and r’r = rr’ = 1, then r— 
unit). 


1_— r’ (also a 


Example 3.1.13. 


(a) As we have observed, Z is an integral domain. Z = {+1}. 

(b) For the ring with 1, Z,, the units in Z, are {[k] € Z, | k 
is relatively prime to n}. We denoted this group by Z* in Chap- 
ter 1, anticipating Definition 3.1.8. 

(c) We have observed that Z,, for p a prime, Q, R, and C are integral 
domains. In fact, they are fields. 

(d) Let R be a commutative ring with 1. We know Riz] = 
{ye sane” | an 6 Bh 


Then Ri{x]* = {ao | ao € R*}. 

In particular, if R is a field, R[x]* = {ao | ao 4 O}. 

We also know R[[x]] = {S52 panz” | an € R}. 

Then R\[x]]* = {UP2 panz” | ao € R*}. 

In particular, if R is a field, R[[x]]* = {UP panx” | ap A O}. 
(Note that difference between these two cases!) 0) 


We don’t always have “cancellation” in rings, but it is very useful 
to note that often we do. 


Lemma 3.1.14. Let R be a ring, and let a € R be any element that 
is not a zero divisor. (In particular, if R is an integral domain this 
is the case for anya € R, a #0.) If ab = ac for some elements b,c 
of R, then b=c, and if ba = ca for some elements of R, then b=c. 


Proof. Suppose ab = ac. Then 0 = ab — ac = a(b—c) and since a 
is not a zero divisor, we must have b — c = 0, i.e., b= c. Similarly if 
ba = ca. 
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In order to obtain new examples of fields, and of integral domains, 
let us carefully examine the familiar construction of the complex 
numbers. 


Example 3.1.15. Let i = /—1. Let 
R(t) = {a + bi | a,b € R}, 


with addition and multiplication defined “as usual”, where i? = —1. 
We recognize that C = R(7). It is easy to see that C is a commutative 
ring with 1. How do we know that C is a field? In order to show this, 
suppose z € C, z £0. Then z = a+ bi with not both of a and b equal 
to 0. Then if z= a— bi, 


zz = (a+ bi)(a — bi) =a? +b? £0, 


z 
-(az) =] 


and z is a unit, with z~! = z/(a*+6*) = a/(a?+6?)+(—b/(a?+02))i. 
Observe that, by the same logic, 


Q(t) = {a+ bt | a,bE Q} 


SO 


is a field. 
Note that we may also form 


Z(i) = {a+ bi | a,b € Z} 


and then Z(7) is an integral domain. Z(¢) is called the ring of Gaussian 
integers. 0) 


Example 3.1.16. Let D #4 0, 1 be squarefree integer, i.e., an integer 
not divisible by any perfect square other than (—1)? = (1)? = 1. Let 
Q(VD) = {a+ bVD | a,be Q} 


with addition and multiplication defined “as usual”, where (VD)? = 
D. It is easy to see that Q(VD) is a commutative ring with 1. In 
order to show that Q(VD) is a field, let z = Q(VD), z 4 0. Then 
z = a+bvVD with not both of a and b equal to 0. Then if Z = a—bVD, 


22 = (a+ bVD)(a — bVD) = a? — BD. 


We claim that a?—b?D # 0. To see this, first observe that if D 4 0, 
1 is squarefree, then D cannot be a perfect square, i.e., cannot be 
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the square of an integer. But then a? — b?D = 0 is equivalent to 
D = b?/a? = (b/a)?, and it is easy to check that if D is not the 
square of an integer, it cannot be the square of a rational number 


either. Thus, 
z 
: (a _ x5) =! 


and z is a unit, with z~! = z/(a? —b?D) = a/(a* —b*D)+4 (—b/(a? 
b2D))VD. 

(Note how this exactly generalizes the construction of Q(z) in 
Example 3.1.15. We obtain Q(i) as Q(VD) for D = —1.) 


Now we form 
{a+bvVD | a,b both integers} if D = 2,3 (mod 4) 
OW/D)= {a+bvVD | a,b both integers or both half-integers} 
if D=1 (mod 4) 


where a half-integer is a rational number c/2 with c an odd integer. 
(Thus, a half-integer is not an integer, but twice it is.) We leave it 
to the reader to show that O(VD) is an integral domain. (The hard 
part of this is to show that O(VD) is closed under multiplication 
in the case D = 1 (mod 4).) O(VD) is called the ring of algebraic 
integers in Q(VD). 

(Again, this is a generalization of the construction of Z(i) in 
Example 3.1.15. We obtain Z(i) as O(WD) for D = —-1.) 0) 


Before we go on, there is a subtle difference between rings and 
rings with 1 that we need to address. It occurs in two forms. 


Definition 3.1.17. 


(a) Let R bearing. A subset S of R is a subring of R if S is a ring, 
with the same operations as R. 
(b) Let R be a ring with 1. A subset S of R is a sub(ring with 1) 
of R if S is a ring with 1, with the same operations as R, and 
ls = 1p. 0) 
Definition 3.1.18. 
(a) Let Rand S be rings. A homomorphism of rings (or ring homo- 
morphism) vy: R—- S is a function satisfying 
y(ri +72) = (ri) + v(ra), y(rira) = (ri)y(r2) 
for all ry,ro € R. 
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(b) Let R and S be rings with 1. A homomorphism of rings with 
1 (or ring with 1 homomorphism) is a homomorphism of rings 
yp: R-> S with y(1pR) = 1g. 

(c) As a special case of (b), if R and S are fields a homomorphism 
of fields (or field homomorphism) is a homomorphism of rings 
yp: R-> S with y(1r) = 1g. 

(d) In either case, an invertible homomorphism is an isomorphism. If 
there is an isomorphism of rings y: R > S, or an isomorphism of 
rings with 1, or of fields, y: R > S, then R and S are isomorphic 
rings, or isomorphic rings with 1, or isomorphic fields. © 


Example 3.1.19. 


(a) The trivial ring R = {0} is a subring of every ring, but is not a 
sub (ring with 1) of any ring with 1. 

(b) For n > 1, nZ is a subgroup of the ring Z, but is not a sub (ring 
with 1) of the ring with 1 Z. 

(c) Q is a subfield of R and R is a subfield of C. O(VD) is a sub 
(ring with 1) of the ring with 1 Q(VD). 

(d) Let R and S be rings with 1 and let T = Rx S. Then T has 
a subring Ro = {(r,0) | r € R} and there is an isomorphism of 
rings with 1 yi: R — Ro by yi(r) = (r,0). Similarly T has a 
subring So = {(0,s) | s € S} and there is an isomorphism of rings 
with 1 yo: S > Spo by ye(s) = (0,5). But, although T is a ring 
with 1 and Rp and Sp are each rings with 1, Ro and Sg are not 
sub(rings with 1) of T, as lp = (1p, 1g) while 1p, = (1r,0) 4 17 
and 1s, = (0, 1s) f lr. 

Lemma 3.1.20. Let S be a subring of the ring R (resp. let S be a 


sub (ring with 1) of the ring with 1 R. Then S is the image of a 
homomorphism of rings (resp. a homomorphism of rings with 1). 


Proof. In either case S is the image of the inclusion 7: S > R. 
Here are some examples of ring homomorphisms. 
Example 3.1.21. 


(a) For any positive integer n, we saw in Chapter 2 that we have a 
homomorphism of groups y: Z > Z, given by y(k) = [k]n (=k 
(mod n)). As we have seen here, both Z and Z,, are rings, and 
it is easy to check that y is a homomorphism of rings (and a 
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homomorphism of rings with 1, except when n = 1, in which 
case Z,, is a trivial ring). 

(b) Similarly, for any m we have a ring homomorphism y: Zmn, > Zn 
given by y([k]mn) = []n- 

(c) Now let m and n be any two relatively prime positive integers. 
Then we have a homomorphism of rings yg: Zmn > Z x Zp given 
by g([A]limn) = ([k]m; [A]n), and the proof of Theorem 2.2.15 shows 
that y is an isomorphism of rings, and an isomorphism of rings 
with 1 providing m,n > 1. 

(d) Consider the polynomial ring R[x]. We have been regarding poly- 

nomials as expressions, but we can also regard them as func- 

tions. If f(x) = ag t+ aya + --- + a,x", then for any r € R. 

f(r) = ao t+ ayr + +++ + a,r”. Then for any fixed element 

ro of R, we have the homomorphism y: R[x] + R given by 

o(f(x)) = f(ro)- 

Let R = Q(VD) as in Example 3.1.16. Then we have a field 

isomorphism y: R + R given by y(z) = Z, where, if z = a+bV/D, 

Z=a—bVD. © 


We use the same language for ring homomorphisms as we do for 
group homomorphisms. 


— 
oO 
ae 


Definition 3.1.22. Let y: R > S be a homomorphism of rings, or 
a homomorphism of rings with 1. Then 


Ker(y) = {r € R | y(r) = 0}, 
Im(y) = {s € S | s= y(r) for some r € R}. .) 
For general rings, or rings with 1, a homomorphism may be injec- 


tive, surjective, both, or neither. But for fields the situation is more 
restrictive. 


Lemma 3.1.23. Let R and S' be fields and let py: R— S be a field 
homomorphism. Then ~ is injective. Consequently, py is an isomor- 
phism if and only if it is surjective. 


Proof. Suppose that r € R,r #40. Then r has an inverse r~!, with 
rr-' = 1. But then 1 = y(1) = y(rr—') = y(r)y(r—") so y(r) £ 0. 
Consequently, Ker(y) = {0} and y is injective. 


Example 3.1.24. Our restriction that D be a squarefree integer, 
D #¥ 0,1, in constructing Q(VD) in Example 3.1.16 was just to 
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eliminate duplication. For suppose D’ = e?.D for some nonzero ratio- 
nal number e. Then we have an isomorphism, y: Q(VD’) + Q(VD) 
given by y(a+bVD’) = at+bevVD. However, this would have changed 
the ring of algebraic integers. The choice we made there turns out to 
be the right choice. © 


3.2 Ideals in rings 


While subgroups are very important in studying groups, it turns 
out that it is not subrings, but rather ideals, that play a key role 
in studying rings. We introduce ideals in this section, and they will 
reappear throughout this chapter. 


Definition 3.2.1. Let I be a subset of the ring R. If (J,+) is a 
subgroup of (R,+) and 


(1) ri € I for every r€ R, ie I then I is a left ideal of R, 

(2) ir € I for every r€ R, ie I then I is a right ideal of R, 

(3) ri € I and ir € I for every r € R,i € I, then I is a two-sided 
ideal, or simply an ideal, of R. © 


We will usually just use the term ideal, but we will occasionally 
use the term two-sided ideal for emphasis. Of course, if R is com- 
mutative, the notions of left ideal, right ideal, and two-sided ideal 
coincide, and in any case a two-sided ideal is both a left ideal and a 
right ideal. 

Note that the condition of being an ideal is stronger than the 
condition of being a subring. A subring must be closed under mul- 
tiplication of any two of its elements, while an ideal must be closed 
under multiplication of any of its elements by any element of the 
ring. 


Example 3.2.2. 


(a) I = {0} is an ideal, the trivial ideal, of any ring R. Any other 
ideal of R is nontrivial. 

(b) J = R is an ideal, the improper ideal, of any ring R. Any other 
ideal of R is proper. © 


Here are some more interesting examples. 
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Example 3.2.3. 


(a) For any integer n, nZ is an ideal in Z. 
(b) Let R = Z[x], the ring of polynomials in x with coefficients in Z. 
We have the following ideals in R: 


I, = {f(z) =ant+ --- +42” | ap = OF, 
Ig = {f (x) = ap + +++ +2” | a; is even for every 1}, 
Iz = {f(x) = a0 + +++ +an2” | ao is even}. 


(c) Let R = Z, and let k be any integer dividing n. Then 
I = {[0], [A], ...,[(n/& — 1)k]} is an ideal of R. 

(d) Let R be a ring with 1 and let X be any set. Let S = 
{functions f: X — R} as in Example 3.1.4 (7). 
For any zp € X, 


T={f:X + R| f (eo) = 0}. 
is an ideal of S. More generally, for any subset Xo of X, 
IT={f:X > R| f(xo) =0 for every x € Xo} 


is an ideal of S. 

(e) Let T = Rx S as in Example 3.1.4 (k). Then R x {0g} and 
{Or} x S are ideals in T. 

(f) Let R be a commutative ring with 1. Then 


{A € M,,(R) | the first column of A is 0} is a left ideal of 
M,(R), and 


{A € M,,(R) | the first row of A is 0} is a right ideal of M,,(R). 
(g) Let R = {upper triangular matrices in M,,(R)}. Then 
I = {strictly upper triangular matrices in M,,(R)} 
is a two-sided ideal of R. 0) 
In fact, we have a sharper result than Example 3.2.3(a). 


Lemma 3.2.4. Let I be an ideal of Z. Then I = nZ for some inte- 
gern. 
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Proof. Note that (J,+) is a subgroup of (Z,+) and we found all 
subgroups of (Z,+) in Lemma 2.3.8: They are all given by nZ, for 
some integer n. Now, while it is certainly not the case that every 
additive subgroup of (R,+) is an ideal of R, that is the case here: 
If m is any integer and n’ is any element of nZ, then n’ = nk for 
some integer k, and so mn’ = n(mk) € nZ, and hence nZ is an ideal 
of Z. 


Here is a result that makes it slightly easier to check that a subset 
of a ring is an ideal. 


Lemma 3.2.5. Let I be a subset of the ring R with 1. Then the 
condition that (I,+) is a subgroup of (R,+) in Definition 3.2.1 may 
be replaced with the condition that I is closed under addition. 


Proof. Suppose that J is closed under addition. We have to verify 
that (I,+) is a subgroup of (R,+). Now for anyie J,0=O0iEe I 
or 0 = 10 € I, or 0 = 0i = 10 € I, by Lemma 3.1.2(b), so I has 
the identity 0 for addition. Furthermore, if R is a ring with 1, then 
if i € I, —i = (-1)t or —7 = i(—-1) or —i = (-1)i = i(-1) € T by 
Lemma 3.1.2(d), so I has the additive inverse of any element of I, so 
(J, +) is a group. 


Here is an easy, but useful, observation. 


Lemma 3.2.6. Let R be a ring with 1, and let I be a left/right/two- 
sided ideal of R. Then I = R if and only if 1 € I. 


Proof. Suppose J is a left ideal. Certainly if J = R then 1 € J. On 
the other hand, suppose 1 € J. Then for every r € R, r=rl1€/TI, so 
I = R. Similarly for a right ideal. And a two-sided ideal is both. 


Remark 3.2.7. Let R be a ring with 1, and let J be a proper ideal 
of R. Then we observe from Remark 3.2.2 and Lemma 3.2.6 that I 
is a subring of R, but not a sub(ring with 1) of R. % 


For the sake of simplicity, we will state the next result only in the 
commutative case. A similar, though definitely not identical, result 
holds in the noncommutative case. 


Lemma 3.2.8. Let R be a commutative ring with 1. Then R is a 
field if and only if the only ideals of R are {0} and R. 
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Proof. Suppose that R is a field. Let I be an ideal of R. If I = {0}, 
there is nothing to prove. If I £ {0}, let i be an element of I, i £ 0. 
Then 1 =i~'t € J, and so, by Lemma 3.2.6, I = R. 

On the other hand, suppose that R is not a field. Let rg € R with 
ro #0 and ro not a unit. Consider I = {rro | r € R}. Then J is an 
ideal of R. Now I 4 {0} asrp = 179 € I. Also, TA Rasifl=R 
then 1 € J sol=ryro =7ror, for some r; € R, in which case 79 is a 
unit, contrary to our hypothesis. 


For the sake of simplicity we will state the next few results just 
for ideals, although they are also valid for left ideals and right ideals. 


Lemma 3.2.9. Let R be a ring and let I and J be ideals of R. Then 


(a) IN J is an ideal of R, 

(b) f+ J={i+ 7 |ie1,j € J} is an ideal of R, 

(c) IJ = {UP _jinge | te € Ign € J fork =1,...,n} is an ideal 
of R. 


Proof. We leave this as an exercise for the reader. 


Remark 3.2.10. Note that, unless J C J or J CJ, TU J is not an 
ideal of R. 0) 


Lemma 3.2.11. Let R be a ring and let Ty, Io, ... be ideals of R with 
hCbhC.... Then I =UlI; is an ideal of R. 


Proof. Let r},r2 ¢ J. Then r; € J;, for some j; and rp € I for 
some jg. Let jg = max(j1,j2). Then r1,r2 € Jj, (as Ij, © Ij, and 
Ty © Jj.) 8071 +72 € Lj, C I. Also, let r € I. Then r € I,, for some 
ji. For any s € R, sr € Tj, CI, and rs € I;, C J. Then, by Lemma 
3.2.5, I is an ideal of R. 


We have an important (rather abstract-looking) condition on a 
ring. 


Definition 3.2.12. Let R be a ring. Then R satisfies the ascending 
chain condition (ACC), or is a Noetherian ring if every sequence of 
ideals 7 C Ig C Ig C ... is eventually constant, i.e., if there is some 
jo such that Ij, = Tjyp41 = Tjpt2 = ..., or, otherwise said, if every 
sequence of ideals I, C Ig C Ig C ... is finite. © 
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Here is a more concrete-looking equivalent condition. We restrict 
our selves to rings with 1 to simplify the next definition. 


Definition 3.2.13. Let R be a ring with 1 and let J be an ideal of 
R. A set {ix} of elements, of I generates I if every element i of J can 
be written as 


i= y ThRSk for some rz, 5% € R. 


In this case we write I =< {iz} >. 

(By convention, the empty sum is equal to 0, so J = {0} is gen- 
erated by the empty set.) 

If J has a finite generating set then J is called finitely 
generated. v) 


Note that, even if {i,} is infinite, any sum in this definition is 
(implicitly) a finite sum — infinite sums do not make sense. If {i} 
consists of a single element, we will often say that i (rather than {7}) 
generates I. 


Theorem 3.2.14. Let R be a ring with 1. Then R is Noetherian if 
and only if every ideal of R is finitely generated. 


Proof. Suppose that every ideal of R is finitely generated. Consider 
a chain J; C Ip C ... of ideals of R, and let J = U;J;. I is an ideal of 
R, by Lemma 3.2.11. Let {ix},=1,... n be a finite set of generators of J. 
Then each iz is in (at least) one of the ideals in the union, say i, € J;,. 
Let jmax = max(j1, ---, Jn). Then iz € I;,,,,, for every k=1,..., n, 
so in particular the ideal generated by {ix},=1,....n is contained in 


Ijnax: But this ideal is just I. Thus we see Ij... © Ujmaxti G'S 
IC Ijin, 80 we must have Jj. = Ljna,tl =" = 

Conversely, suppose that R has an ideal I that is not finitely 
generated. We construct an infinite chain of ideals Ij C Ig C ... as 
follows: 

Let 71 be any nonzero element of J and let I, be the ideal generated 
by {t1}- 


Let 72 be any element of J that is not in J, and let I, be the ideal 
generated by {i1, ig}. 

Let 73 be any element of J that is not in Iz and let [3 be the ideal 
generated by {71, i2, 73}. 
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Continue, and note that we can go on forever, as if we had 
to stop at some stage n, I would be generated by the finite set 


Vids cs <9 tat 
Example 3.2.15. 


(a) Any field R is a Noetherian ring, as it only has two ideals {0} 
(generated by { }) and R (generated by {1}). 

(b) We saw in Lemma 3.2.5 that every ideal of Z is of the form 
I = nZ for some integer n. If n = 0, I = {0}, generated by { }, 
and if n £ 0, I is generated by {n}, so Z is Noetherian. 

(c) We shall see that if R is a field, then any nonzero ideal J of R[x] 
is generated by {p(x)} for some nonzero polynomial p(x), so R[z] 
is Noetherian. 

(d) We shall see in Corollary 3.3.14 that, for any positive integer 


n, the ring Z[x,,..., 2] of polynomials in the (commuting, 
as usual) variables x1, ..., 2, with coefficients in Z is Noethe- 
rian, and also that for any field R, the ring R[x, ..., 2p] is 
Noetherian. 6) 


Remark 3.2.16. Observe in Example 3.2.15 (a), (b), and (c) every 
nonzero ideal was generated by a single element. This turns out to 
be a crucially important property which we will have a lot to say 
about below. 0) 


Here, by contrast, are some non-Noetherian rings. 
Example 3.2.17. 


(a) Let R bea field and consider the ring R[x1, x2, ...] where we have 
a variable x; for every positive integer 7. Thus R[xz1,x2,...] isa 
polynomial ring in infinitely many variables. Let J be the ideal of 
R{x1, £2, ...] consisting of polynomials whose constant term in 
zero. (Note that I is generated by the infinite set {x1, 22, ...}.) 
We claim that J is not finitely generated. Suppose it was, that 
I had a finite generating set {p1,..., pe}. Now each p; is a 
polynomial, so is an expression in only finitely many variables. 
Thus only finitely many variables appear in expressions in the 
finite set {p1, ..., px}. Choose x; to be a variable that does not 
appear. Then x; € J, but x; is not in the ideal generated by 
{pi, «+1 Pe}. 
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(b) 
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Let R= Qand Ro = Z. Let S be the ring 
S = {p(x) € R[x] | the constant term of p(x) is in Ro}. 


Choose any nonzero element ro € Ro that is not a unit in Ro. 
(For example, we could choose rp = 2.) Let J; be the ideal of S 
generated by (ro)~‘a for each i = 0,1,2,.... Then we have an 
infinite chain Jg Cy Clg Cc ..., and so S is not Noetherian. 
Let R be a field and let S' be the ring of “polynomials in pos- 
itive rational exponents of x with coefficients in R”. That is, 
an element of S is an expression f(x) = ag + U%_,a;2% where 
a0,@1,---, Gn € Rand q is a positive rational number. We define 
addition and multiplication in S “as usual” (where, “as usual”; 
gig? — pita), 


Let J; be the ideal of S generated by «1/ 2' Then we have an 


infinite chain Jo C 4 C Ig C ..., and so S is not Noetherian. 
(Observe that Ip7 Uy U Ig U ... = I, where I is the ideal of S 
consisting of those f(z) with constant term ag = 0.) % 


Ideals are important for many reasons (as we shall see), but here 


is one important one. 


Lemma 3.2.18. Let R and S be rings and let py: R— S be a ring 
homomorphism. Then Ker(y) is an ideal of R. 


Proof. First observe that if yg is a ring homomorphism, it is a 
homomorphism of the additive groups Q = (R,+) — (S,+). Then 


we 


know that Ker(y) is a subgroup of (R, +). Indeed, we know that it 


is anormal subgroup, but since (R, +) is an abelian group, subgroups 
and normal subgroups coincide. 


Also, let i € Ker(y) and let r € R. Then 


y(ri) = y(r)y(t) = y(r)0=0 so rie Ker(y) 


and 


and hence we conclude that Ker(y) is an ideal 


y(ir) = p(i)y(r) = Oy(r) =0 so ir € Ker(y) 


Now, forgetting about multiplication for the moment, R is an 


abelian group under addition and an ideal J of R is a subgroup, so 
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we may form the quotient group R/I. But of course we are working 
with rings, and we would like R/J to be a ring. Indeed, it is, as we 
now see. 

Let us recall that the elements of R/I are cosets of J, that we can 
write any coset of J as r+J for some r € R, and that two cosets 
r+TJ and r’+T are the same, or equivalently, that r and r’ are 
representatives of the same coset, if and only if r’ = r +i for some 
element i of I. (Here I have deliberately written cosets rather than 
left cosets to remind you that we are in the abelian case.) 


Theorem 3.2.19. Let R be a ring and let I be an ideal of R. Then 
R/I is a ring with multiplication given by 


(ry + I)(r2 +I) =ryrot+]. 


Furthermore, the map 7: R > R/I given by r(r) =r+T is a ring 
homomorphism. 


If R is a ring with 1 and I is a proper ideal of R, then R/I is a 
ring with 1 and x is a homomorphism of rings with 1. 


Proof. The key thing to check is that multiplication is well-defined, 
i.e., independent of our choice of coset representatives. So suppose 
rmot+2=rn4+/ and rh+JI =1ro+T]. We have to show that (ri, + 
D (ry +1) = (ri + Dra + D. 
Now we know that rj = 7r1+i1 and rf = rg+% for some 4}, ig € I. 
We compute 


rr = (r1 + t1)(re t+ ta) = rire + rite t+ tire + trig 


=1r1rT2 +13 where 13 = 1T1l9 +2172 +2422 


and we observe that 73 € J precisely because J is an ideal. Then we 
see that 


rr +l =ryrot+] 


(ae) = ie ed) 


as required. 
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We leave it to the reader to check the remaining ring axioms, and 
to check that 7 is a ring homomorphism. 

In the case of rings with 1 there is a little more work to do. 
Recall that the trivial ring is not a ring with 1, so we cannot have 
R/I = {0}, ie., we cannot have J = R, i.e., J must be a proper 
ideal of R. Conversely, if J is a proper ideal of R, we can form R/T, 
and then check that 1+ J is the multiplicative identity in R/I. But, 
remembering that J = 0+ I is the zero element of R/I, we need 
1+I4I,ie., 1¢ J. That must be the case as if we had 1 € J, then 
by Lemma 3.2.7, we would have J = R, which it is not. Given this, 
we also see in this case that 7(1) = 1+, which we just observed 
was the multiplicative identity in R/I, so 7 is a homomorphism of 
rings with 1. 


Definition 3.2.20. The ring (or ring with 1) R/I of Theorem 3.2.19 
is the quotient of R by I, and the homomorphism 7: R > R/T is the 
quotient map (or canonical projection). © 


Remark 3.2.21. If 7: R > R/I is the quotient map, then 7 is an 
epimorphism and Ker(z) = J. .) 


There is one thing that may have bothered you a bit. When 
we formed R/I, we formed the quotient of additive groups, i.e., we 
formed the quotient of R under the equivalence relation that two ele- 
ments r and r’ are equivalent if r’ = r +7 for some 7 € I. Shouldn’t 
we have taken the multiplication on R into account as well? The 
answer is that we did. We did not try to take R/J for I simply a 
subgroup of the additive group of R, but rather required J to be an 
ideal, and the requirements for J to be an ideal involve the multi- 
plication on R. You can see from the proof of Theorem 3.2.19 that 
these requirements were what we needed for multiplication in R/I 
to be defined (i.e., these requirements were precisely what we needed 
to ensure that, in the notation of that proof, 73 € I). 


Example 3.2.22. 


(a) For n > 0, the ring Z,, is the quotient of the ring Z by the ideal 
nZ. Note that the ideal nZ is generated by {n}. 
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(b) Let R= Z[z] as in Example 3.2.4 (b). Then we have an isomor- 
phism 9; = R/I, > Z given by y(f(x)) = ao. Note that the 
ideal I, is generated by {2}. 

We have an isomorphism yo: R/I_g > Ze[x] given by yo(f(x)) = 
[ao]2 + --- + [an]ox". Note that the ideal Jy is generated by {2}. 
We have an isomorphism v3: R/Iz3 — Za given by y(f(x)) = 
[ao]2. Note that the ideal Iz is generated by {2, x}. » 


When we derived the Chinese remainder theorem (Corollary 
2.2.17) in Chapter 2, we mentioned that it is best understood in 
terms of ring theory. We now derive that here (in a more generalized 
form). 


Definition 3.2.23. Let R be a commutative ring with 1. Two ideals 
I and J of R are coprime if 1+ J = R. © 


Example 3.2.24. Let R = Z and let m and n be relatively prime 
integers. Then there are integers x and y with mz + ny = 1. Now if 
I =mZ and J = nZ, then mz € I and ny € J, sol € 14+ J and 
hence J+ J = R. Thus, we see that if m and n are relatively prime 
integers, the ideals mZ and nZ are coprime ideals of Z. 

Also note that if m and n are relatively prime, 


mZO nZ = {integers divisible by both m and n} 


= {integers divisible by mn} = mnZ. 0) 
Lemma 3.2.25. Let R be a commutative ring with 1 and let I 
and Jy,..., JIm be ideals of R with I and J, coprime for each 
k=1,...,m. Then I and J= J, +--+ NJm are coprime. 


Proof. For each value of k, since J and J}, are coprime, there is an 
element 7; of J and an element 7; of J, with 7, + 3, = 1. Then 


1 = (41 + ji) (ta + ja)... (im +m) =t4+ 9 


where 7 = J1,---, Jm and 7 is the sum of the other terms in the 
product. Now every term in 7 has at least one i, factor, soi € J, 
and 7 has a jz factor for every k, soj7 EC J= 1M... AV Jm. Thus, 
1el+J and hence J+ J = R. 


Theorem 3.2.26 (Chinese remainder theorem). Let R be a 
commutative ring with 1 and let I1,..., In be pairwise coprime 
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ideals of R (i.e., Ip, and Ip, are coprime whenever ky # ko). Let 
T1,---,Tn be arbitrary elements of R. Then the system of simulta- 
neous congruences 


x =r, (mod I) 


x =rg (mod Iz) 


L=Tp (mod I,) 


has a solution x = b in R, and x = U' is a solution if and only if 
b' = b (mod NN --- NIp). 


Proof. In Lemma 3.2.25, lett J = and J = bh,Jg = 
T3,...,Jn—1 = In. Then and J = J7 MN... M Jn_1 are coprime, 
so write 1 =7i+ 7 with 7 and j as in that lemma. Since 7 € J, 7 € Jy 
for each k = 1,...,n—1, i.e, 7 =O (mod Jz) for each such k. Also, 
i €TI,i.e., i= 0 (mod J,), so the equation 1 = i+ 7 gives 


l=i+7=0+7=3 (mod i). 


Set hy =i. Then hy = 1 (mod J;), hy = 0 (mod Ix) for k £1. 

Now repeat the same process with J = Jo and 4 = ,Jo = 
I3,...,Jn—1 = In to obtain an element hz of R with hg = 1 (mod 
In), hg = 0 (mod Ix) for k # 2. Keep repeating the process for 
I =Ts,..., 1 =I, to obtain elements hy, ..., hn of R with hy = 1 
(mod J,) and hy =0 (mod Jy’) whenever k’ 4 k. Now set 


b= - Arr: 
k=1 


Then 0 is a solution of this system of congruences. 
Now if b) = b (mod .N--- NI,), then, as 1 --- VIn C Ty for 
each k, b! = b (mod J) for each k, so 0! is also a solution. On the 
other hand, if b’ is any solution, then b’ = b (mod JI;) for each k, ice., 
b'—b € Ix for each k, in which case b' —bE TI, N +--+ NIn, ice., D' =b 
(mod 1, --+ Nn). 
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Corollary 3.2.27. Let my, ..., Mn be pairwise relatively prime inte- 
gers, and letr,, ..., Tn be arbitrary integers. Then there is an integer 
b satisfying the system of simultaneous congruences 


b=r, (mod m)) 


b=r2 (mod m2) 


b=Tpn (mod m,), 


and an integer b' is a solution of this system if and only if b! = b 
(m0d Wij, s<.5 Mig) 


Proof. From Example 3.2.24, we see that this is just the special 
case of the Chinese remainder theorem when R = Z. 


Remark 3.2.28. We will see later (Example 3.7.5) that there is an 
effective method for obtaining the ring elements h1,..., hy in the 
proof of the Chinese remainder theorem for many rings, including 
in particular R = Z, and so there is an effective method for solving 
simultaneous congruences in these cases. 0) 


We have stated Theorem 3.2.26 in terms of ideals, but we can 
restate it in terms of quotients. 


Theorem 3.2.29 (Chinese remainder theorem). Let R be a 
commutative ring and let I1,..., In be pairwise coprime ideals of 
R. Then we have a ring isomorphism 


@: R/(IyN «++ A In) 3 (R/T) x (B/Ib) x... (R/In)- 


Proof. Let (r1+h,ret+l, ..., Tn+In) be any element of (R/I,) x 
--» x (R/I,). Then by Theorem 3.2.26 there is a element b of R with 
(64+ ,b+],,...,b+In) = (r+ h,reat+ hy, ..-, 1m +In). In other 
words, if we define y: R > (R/Ih) x (R/In) x +++ x (R/In) by 


g(a) =(t+h,e+h,...,0+In) 
Then y is onto. What is Ker(y)? Certainly 0 € Ker(y). But then 


by Theorem 3.2.26 again Ker(y) = {b' € R |) = 0 (mod nN 
+ O1,)} =hN--- AIn, so (R/J,) x +++ (R/In) is isomorphic to 
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R/Ker(y) = R/(41.9--- O1,). Indeed, tracing through the definition 
of the quotient we see that the isomorphism ¢ is given by 


P(et+hn--- Am) =(e¢+h,...,¢+In). 


We have previously proven that the ring Z,,, is isomorphic to the 
ring Zm X Zp, in case m and n are relatively prime. That proof was 
indeed correct, but here is the “right” proof of that result, which we 
state here for an arbitrary number of factors, not just two. 


Corollary 3.2.30. Let my, ..., mn be pairwise relatively prime inte- 
gers. Then we have a ring isomorphism 


Blin wie, “DS Dig vee BD, 


Proof. This corollary follows from Theorem 3.2.29 in the same way 
that Corollary 3.2.27 followed from Theorem 3.2.26. 


3.3. The integers, and rings of polynomials 


In this section, we want to look at some of the basic properties of Z, 
as well as basic properties of polynomial rings. 


Theorem 3.3.1 (The division algorithm in Z). Let a € Z and 
letbe€ Z with b#0. Then there are integers q and r such that 


a=bq+r with bl: 


Proof. We shall prove this in case a > 0 and 6 > O. The other 
cases can be reduced to this case, and we leave them as an exercise. 

We prove this by complete induction on a. 

If 0 < a < b then we have a = (0+aso0 a= bq+r with q = 0 
and r = a, and by our assumption on a, 0 < r < |b]. 

Suppose that a > 6b and the theorem is true for all nonnegative 
integers < a. Since a > b, a— b > 0, so by the inductive hypothesis 
a—b= bd +r for some integer g’ and some r with 0 < r < |b|. But 
then a = bg +r with q = qd +1 and r unchanged, so we still have 
0<r< |b|. Thus, the theorem is true for a as well. 

Then by induction we are done. 
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Now we turn our attention to polynomial rings. 


Definition 3.3.2. Let R be a commutative ring with 1 and let 
f(x) € Riz] be a nonzero polynomial. If f(a) = a9 + +--+ + an”, 
the degree of f(x) is n. (In particular, a polynomial of degree 0 is a 
nonzero constant polynomial.) © 


Remark 3.3.3. Note that the degree of the 0 polynomial is 
undefined. 7) 


Lemma 3.3.4. Let R be an integral domain. If f(a) and g(x) are 
polynomials in Rix] of degrees m and n respectively, then f(x)g(x) 
is a polynomial in Rix] of degree m+n. 


Proof. If f(x) =ag+--- +amzx™ with a, 4 0 and g(x) = bo + 
+++ + b,x”, with b, #0, then f(x) = agbo + ++: + Qmbnx™™” and 
Ambn #0 as R is an integral domain. 


Definition 3.3.5. A polynomial is monic if the coefficient of its high- 
order term is equal to 1, i.e., a polynomial f(x) = a9 + +--+ + an” 
of degree n is monic if a, = 1. © 


Theorem 3.3.6 (The division algorithm for polynomials). 
Let R be a commutative ring with 1. Let f(x) € Rix] be a polynomial 
and let g(x) € R[x] be a nonzero polynomial. Suppose that 


(1) g(x) is monic; or 
(2) R is a field. 


Then there are unique polynomials q(x) and r(x) € Ria] such 
that 


f(x) = g(x)q(x) + r(x) with r(x) = 0 or deg r(x) < deg g(x). 


Proof. We shall prove existence first and afterwards prove 
uniqueness. 

First of all, if f(z) = 0, then f(x) = g(x)q(x)+r(zx) with q(x) = 0 
and 7(z)= 0. 

Suppose that f(x) is nonzero. Let f(a) = a9 + --- +@mx™ have 
degree m and g(x) = bo + --- + bna” have degree n. In case (1) 
by, = 1, 
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First we handle the case n = 0, In case (1) we have f(x) = 
1f(x) +0 and in case (2) we have f(a) = bo(bp 'f(a)) +0 

Now suppose n > 0. We proceed by complete induction on m. If 
m<n, then f(x) = g(x)0+ f(x) and we let r(x) = f(z). 

Suppose the theorem is true for all polynomials of degree < m, 
and let f(x) have degree m. In case (1), let qi(x) = ama” and 
in case (2) let qi(x) = (amb; !)a™-”. Then f(x) and g(x)qi(x) are 
both polynomials of degree n with leading coefficient (the coefficient 
of the high-order term x”) equal to am. 

If f(x) = g(x)qi (x), let r(x) = 0, and we are done. 

Otherwise let f’(x) = f(x) — g(x)qi(x). Then f’(x) has degree 
less than m, so by the inductive hypothesis there are polynomials 
q(x) and r(x), with r(x) = 0 or deg r(x) < deg g(x), such that 
f' = g(x)d' + r(a). But then, if q(x) = u(x) + q'(2), 


f(x) = g(x)q(x) + r(x) with r(x)=0 or deg r(x) < deg g(x) 


Thus, the theorem is true if f(x) has degree m as well. 

Thus, by induction we are done. 

Now we must prove uniqueness. 

Suppose f(x) = g(x)qi(@) + 1ri(@) and f(x) = g(x)q2(x) + r2(x) 
with ri(z) = 0 or deg ri(x) < deg g(x) and with ro(x) = 0 or 
deg ra(a) < deg g(a). Then g(x)qi(x) + ri(a) = g(@)q2(@) + r(x) 
which yields the equation g(x)(qi(a) — qo(x)) = re(x) — ri(z). 

Now the right hand side is either 0 or a polynomial of degree < 
deg g(x). If qi(x) A g2(x), then qi (x) — q2(x) A 0 and, since g(x) is 
monic or R is a field, the product of the leading coefficients of g(x) 
and qi(#) — g2(x) is nonzero, so the left hand side is a polynomial 
of degree > deg g(x). But if so it is impossible for the left hand and 
right hand side to be equal, so this cannot be the case. Thus, we 
must have qi(%) = go(x), in which case r1(%) = ro(x) as well, and 
the expression for f(z) is unique. 


(You have surely noticed that the idea of the proof of the existence 
part of Theorem 3.3.6 was exactly the same as the idea of the proof 
of Theorem 3.3.1.) 


Corollary 3.3.7. Let R be a commutative ring with 1. Let f(x) € 
R{x] be a polynomial. Then for anya € R, f(x) = (x—a)q(x)+ f(a) 
for some polynomial q(x) € Rix]. In ager’ La es Fe) m 


Rix] if and only if a is a root of f(x), «e., of f(a) = 
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Proof. By Theorem 3.3.6, f(x) = (~—a)q(x) +c for some constant 
polynomial c, and then f(a) = (a—a)q(a) +c=c. 


Lemma 3.3.8. Let R be an integral domain. Let x — a divide the 
product h(x) = f(x)g(x) of the polynomials f(x), g(x) € R[x]. Then 
x —a divides f(x) or x —a divides g(x). 


Proof. If «—a divides h(x), then h(a) = 0. But h(a) = f(a)g(a). 
Thus f(a)g(a) = 0. Since R is an integral domain, (at least) one of 
the factors must be 0. So f(a) = 0, in which case (x — a) divides 
f(x), or g(a) = 0, in which case x — a divides g(x). 


Corollary 3.3.9. Let R be an integral domain. Let f(x) € R[x] be 
a nonzero polynomial. If f(x) has degree n, then f(x) has at most n 
roots in R. 


Proof. By induction on n. If n = 0, f(x) is a nonzero constant 
polynomial, so has no roots. 

Assume the corollary is true for all polynomials of degree n — 1, 
and let f(x) have degree n. If f(a) does not have a root, then we are 
done. Suppose f(x) has a root a. Then f(x) = (x — a)q(x) for some 
polynomial q(x) of degree n — 1. By the inductive hypothesis, q(x) 
has at most n — 1 roots, so f(z) has at most n — 1+ 1 =n roots. 

Then by induction we are done. 


These easy arguments, combined with the group theory we have 
done, have a powerful consequence. 


Theorem 3.3.10. Let R be an integral domain. Then any finite sub- 
group of R*, the group of units of R, is cyclic. 


Proof. Let G be a finite subgroup of R* and suppose that G has 
order n. Then g” — 1 for every g € G*, i.e., every g in G* is a root of 
the polynomial f(z) =z” —1€ R[z]. 

Now let d be any integer dividing n. If g € G has order dividing 
d, then by the same logic g4 = 1 so g is a root of the polynomial 
f(a) = 24-1 in R[x]. By Corollary 3.3.9, f(x) has at most d roots 
in R. Thus, there are at most d elements of R, and hence at most d 
elements of G, of order dividing d. 

Hence, by Corollary 2.2.6, G is cyclic. 
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Here is the most important special case of this theorem. 


Corollary 3.3.11. Let R be a finite field. Then R* = R— {0} is 
cyclic. In particular, for any prime p, Zy, is cyclic. 


Proof. If R is finite, R* is finite, so we may directly apply Theorem 
3.3.10. 


Remark 3.3.12. Note that in Lemma 3.3.8, Corollary 3.3.9, and 
Theorem 3.3.10 we had the hypothesis that R is an integral domain. 
This hypothesis is necessary and in general these results are false 
without it. 

For example, let R = Zg. Note that 9 = 1 (mod 8). Thus 27-9 = 
x? —1 in R[z]. Of course, x — 3 divides x? — 9 = (x — 3)(x + 3). Thus 
x — 3 divides x? —1 = (x—1)(x+1) but clearly x —3 does not divide 
either factor. Also, note that 12 = 3? = 5% = 7? = 1 in Zg, so the 
quadratic polynomial x? — 1 has the four roots 1, 3, 5 and 7 in Zg. 

Finally, note that Zg = 1,3,5,7 and this group is isomorphic to 
Z x Zo, not to the cyclic group Zy4. © 


We now prove one of the standard results of ring theory. 


Theorem 3.3.13 (Hilbert basis theorem). Let R be a commuta- 
tive Noetherian ring with 1. Then Ra] is a commutative Noetherian 
ring with 1. 


Proof. Let J be an ideal of R[z]. If J = {0} then J is certainly 
finitely generated, so suppose not. 

For a nonzero polynomial f(x) € Riz], let h(f(x)) be the coef- 
ficient of its high-order term, i.e., if f(z) = ag + --- + an", then 
h(f(x)) = an. Define ideals Ip CI, Clg C ... of R by 


In = {O} U {A(f(z)) | F(x) € Riz] 
is a polynomial of degree at most n}. 


(Note we have to include 0 separately as it cannot be a high-order 
coefficient.) It is easy to check that each I, is an ideal of R. 

Since R is Noetherian, this sequence is eventually constant. 

Choose a value N for which Iy = Iy41 = In +2 

For any value of n, let 


I) = 0u{h(f(z)) | f(x) € Riz] is a polynomial of degree exactly n}. 
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Certainly I/, C I,. We claim that in fact I/, = I,. To see this, 
suppose we have r € I;,, so that there is some polynomial f(x) € R{z] 
of degree k < n with h(f(z)) =r. Ifk=n, Thenre I Ifk <n, 
note that x”—* f(x) is a polynomial of degree n with h(x”~* f(x)) = 
h(f(x)) =r, so in this case, too, r € I). Thus Ij, C In, and so they 


are equal. 
Now R is Noetherian, so each ideal J, is finitely generated. For 
each n = 0,1,..., N let {fni(z), .--, fnk, } be a set of polynomials 


of degree exactly n that generate I,,. (We can choose the degree to 
be exactly n precisely because I!, = I;,.) We claim the set 


S= { foi(x), aay Fok (2); Fr), Samy Fie (2) 
fog NIE); Sey Fey 


generates I. Since this set is finite, J is finitely generated, and since 
I was an arbitrary ideal of R[x], this shows R[x] is Noetherian. 

Thus we must prove this claim. Let J be the ideal of R[x] gener- 
ated by S. Since every element of S is in J, we certainly have J C I. 
We need to show J C J. In other words, what we need to show is: 

Let f(x) € R[x] be a polynomial in J. Then f(x) € J. 

This is certainly true if f(x) = 0. Suppose f(a) is nonzero. We 
prove the claim by induction on n = deg(f(x)). Write f(z) = ao + 
coe of axa”. 

If n = 0, then f(x) = ao, a constant polynomial, and h(f(x)) = 
ag. Then {foi(x) = ao1, ---; foko(®) = Goro} generates Ij = Ip, so 
ao = 701401 + +++ + Tok o@oko for some elements 191, -.-, Tok, of R, 
ie, f(x) = ror foi(@) + +++ + Toko foxo(x), and f(x) € J. 

Now suppose the claim is true for all polynomials of degree < n, 
and suppose f(a) has degree n. There are two cases: 

Case 1: n < N. Then by the definition of {fni(z), ..-, fre, (x)}, 
these polynomials all have degree n, and if these polynomials have 
leading coefficients Gn1,..., Ank, We have Gyn = Tnidni + ++: + 
l'nkn nk,» {OY SOME Tn1, ---, Tnk, in R. In other words, the polynomi- 
als f(x) and g(x) = raifni(x) + +++ +1 nky fink, (x) are polynomials 
with the same degrees and with the same leading coefficient, so either 
f(x) = g(x) € J, or h(x) = f(x) — g(x) is a polynomial of degree 
<n, with h(x) € I. But then by the inductive hypothesis h(x) € J, 
and so f(x) = g(x) + h(x) € J as well. 
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Case 2: n > N. In this case [, = Iy. Then by the defini- 
tion of {fi (x), ..-, fey (x)} these polynomials all have degree N, 
and if these polynomials have leading coefficients ani, ..., @Nky, We 
have @n = Trn1@n1 + +: + TNkNG@Nky for some TrN1,.--,TNky in 
R. Now the polynomials f(x) and g(x) = ryia"—N fyi(a) + +++ + 
ike fky (x) are polynomials with the same degree and with 
the same leading coefficient, so, applying the some inductive argu- 
ment as in Case 1, we conclude f(x) € J in this case as well. 

Then by induction we are done. 


Corollary 3.3.14. 


(a) Let R be field. Then for any positive integer n, R[x, ..., @n] ts 
a Noetherian ring. 
(b) Let R= Z. Then for any positive integer n, R[x1,..., Ln] is a 


Noetherian ring. 


Proof. We saw in Example 3.2.15 that any field is a Noetherian 
ring, as is Z. Then this corollary follows from Theorem 3.3.13 by 
induction on the number of variables. 


3.4 Euclidean domains and principal ideal domains 


We have been assuming, and using, basic properties of the integers 
Z all along. Now we would like to prove them. Why have we not 
proved them before now? The reason is that we want to prove them 
in a more general context. The context is that of principal ideal 
domains (PIDs). 

That is, we want to show: 


(1) Zis a PID; and 
(2) PIDs have important properties. 


We can then conclude that Z has these properties. 

Of course, if Z were the only PID we would not have gained any- 
thing by this approach. But it is not. There are a variety of rings that 
are PIDs. So the advantage of our strategy is that we can prove these 
properties once and for all, and then they hold for all PIDs, rather 
than having to go back and reprove them everytime we encounter a 
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new PID. Not only would this be inefficient, it would be repetitious, 
as the basic ideas of the different proofs would be all the same. Even 
worse, we could never get a complete proof, as there might always be 
anew PID we hadn’t thought of. These considerations aside, when we 
examine proofs, we would like to distill out their essential ideas, and 
then see how to use them most effectively, and that is what we will 
be doing here. Historically speaking, these results were first proved 
for Z (actually, first proved in the context of the positive integers) 
and then, millennia later, generalized. With the benefit of hindsight, 
we will be proving these results in general and then obtaining them 
for Z, and certain other rings, as special cases. 

You'll notice I have been talking about step (2) here, and indeed 
talking very vaguely, as I have not even told you what these properties 
are. So I will have the ask you to believe me when I tell you that 
they are important while we carry out step (1). (First things first!) 
But I have said we do not just want to do this for Z, but for other 
rings R as well. Thus our step (1) will be 


(1) Show that certain rings are PIDs. 
But actually, we will be doing step (1) in two sub-steps 


(la) Show that certain rings are Euclidean domains. 
(la) Show that every Euclidean domain is a PID. 


The point of doing things this way is that we have a concrete 
numerical criterion for showing that a ring is a Euclidean domain, 
and hence a PID, and we will see some concrete applications of this 
criterion. 

Also, the reason some rings are called Euclidean domains is that 
we can perform Euclid’s algorithm in them. Euclid’s algorithm is an 
algorithm, which means we can use it to perform effective (and, it 
turns out, efficient) computations, which we will illustrate. 

I should point out that step (1b) says that every Euclidean domain 
is a PID, but does not say that every PID is a Euclidean domain. In 
fact, that is false — there are PIDs that are not Euclidean domains. 
But these do not play as important a role as Euclidean domains, so 
we will not concern ourselves with any of them. 

We now go to work. 
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Recall that N denotes the natural numbers, i.e., the positive 
integers. 
We will let N denote the nonnegative integers, N= NU {0}. (This is 
not standard notation.) 


Definition 3.4.1. Let R be an integral domain. A norm 6 on Risa 
function 6: R— {0} > N such that 6(a) < (ab) for all a,b € R—{O}. 
The norm 6 on R is a Euclidean norm if for any a € R with b ¥ 0, 
there are elements gq and r of R with 


a=bq+r where r =0 or d(r) < 4(b). 


If R has a Euclidean norm 46, then R is a Euclidean domain (with 
respect to 0). ) 


There is a kind of norm that is, as we shall see, particularly easy 
to work with. 


Definition 3.4.2. Let R be an integral domain. A norm, or a 
Euclidean norm, on R is multiplicative if 6(0) = 0, 6(1) = 1, and 
d(ab) = 6(a)d(b) for all a,b € R. .) 


Theorem 3.4.3. The function d(a) = |a| is a multiplicative 
Euclidean norm on Z, and hence Z is a Euclidean domain. 


Proof. This is Theorem 3.3.1. 


Theorem 3.4.4. Let R be a field. The function d(p(x)) = deg p(x) 
is a Euclidean norm on R[x], and hence, if R is a field, Rix] is a 
Euclidean domain. 


Proof. This is Theorem 3.3.6. 


We have the following general properties of norms. 
Lemma 3.4.5. Let R be an integral domain with norm 6. 


(a) For anyr € R, r £0, d(r) > d(1). 

(b) Ifu is a unit in R, d(u) = 6(1). 

(c) If R is a Euclidean domain with norm 6, andr € R with d(r) = 
6(1), then r is a unit in R. 

Proof. 


(a) O(7) =o(1r) = o(1). 
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(b) If wis a unit, let v € R with uv = 1. Then d(1) = d(uv) > d(u) 
and 6(u) > 6(1) by (a), so d(u) = (1). 

(c) Since R is a Euclidean domain, we have that 1 = rs +t for 
some s € R and some t € R with t = 0 or d(t) < d(r). But if 
6(r) = 6(1) there are no elements t of R with d(t) < d(r), so we 
must have t= 0, 1 =rs, and sor is a unit in R. 


Now we specialize our attention. 
Lemma 3.4.6. 


(a) Let D be a squarefree integer and let 6 be the function on QVD 
defined by 5(z) = |zz|. (Recall that if z = a+bVD, with a,b € Q, 
then Z = a—bV/D.) Then 6(z22) = 6(z)6(z2) for all 21,2 in 
Q(vD). 


(b) The function 6 restricts to a multiplicative norm on O(VD). 


Proof. Part (a) is direct calculation, which works just like calcula- 
tions in the complex numbers. (Note that 6(a+bVD) = |a? — b?D|.) 
Then to show part (b), we simply need to show that 6(z) € N when- 
ever z € O(VD). This is clear when D = 1 or 3 (mod 4), as then a 
and b are integers. We leave the more interesting case D = 1 (mod 
4) to the reader. 


It is certainly not the case that 6 is always, or even usually, a 
Euclidean norm on O(v D). In fact, this is the exception rather than 
the rule. But it is true in the following particularly important case. 


Theorem 3.4.7. The function 6 is a multiplicative Euclidean norm 
on O(./—1) = Zl]. Consequently the Gaussian integers Zli] are a 
Euclidean domain. 


Proof. Leta=w-+ai € Zii] andb=y+2i € Zi], b #0. Then w, 
x,y, and z are integers with not both y and z equal to 0. 

We have shown that Q(i) is a field (Example 3.1.16), so we divide 
a by 6 in Q(z). Then 


a wt whet yom _ wyt+e2+(—we+ay)t ela 
b ytai ytai yz y2 + 22 
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where 


js ew and v = 
yr rz 


—wz + ry 


EQ. 
y? + 2? Q 


If (we are very lucky and) u € Z,v € Z, set q = u+ vi. Then 
q € Zi] and a/b = q, ie., a = bg = bg +0 so set r = 0 and we are 
done. 

Suppose not. Choose integers m and n with |u—m| < 1/2, |u—n| < 
1/2, and let g= m-+ni. Then 


a = 0(a/b) = bu + vi) = ((m + (u—m)) + (n+ (v —n))i) 
= b(m + ni) + b((u — m) + (uv — u)t) 
=bq+r 
where 
r= b((u—m) + (v—n)i). 


First let us observe that, since Z[i] is a ring, r = a — bq € Zi]. 
The key thing we have to show is that d(r) < 6(b). To this end, 
let 


s=(u-—m) + (v—n)i € Q(t). 


Then 6(s) = (u—m)* + (v—n)? < (1/2)? + (1/2)? = 1/2 and so, 
since 6 is multiplicative on Q(%), 


5(r) = 6(bs) = 5(b)6(s) < (1/2)5(b) < 5(b) 


as required. 


Remark 3.4.8. Observe that for any D, if z € O(WD) with 6(z) = 
1, then z is a unit: The equation 6(z) = 1 is the equation |zz| = 1, 
1.6.22 = 1, so z(+2) = land z is a unit. 


Now we come to principal ideal domains. We begin with the 
definition. 


Definition 3.4.9. Let R be a commutative ring with 1. An ideal I 
of R is principal if it is generated by a single element ro of R, or, 
equivalently, if J = {rro | r € R}. 

An integral domain R is a principal ideal domain (PID) if every 
ideal in R is principal. © 
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Remark 3.4.10. If R is a field, then R is a PID, as R only has two 
ideals, {O}, generated by 0 € R, and R, generated by 1 € R. As 
we will see, we are mostly interested in PIDs because of questions 
of divisibility, and these questions are completely uninteresting for 
fields, as every nonzero element of a field is a unit. © 


Here is our next sub-step. 
Theorem 3.4.11. Let R be a Euclidean domain. Then R is a PID. 


Proof. We have to show that every ideal J of R is principal. 
If J = {0}, then J is generated by the single element 0. 
Suppose that {J 4 {0}}. Consider the following set S: 


S={6(r) |reie 40} 


Note that S is a nonempty subset of N. Thus, S$ has a smallest 
element so. Let ro € I with (ro) = so. We claim that the single 
element rp generates J, or, in other words, that I = IJ,, = {rro | r € 
R}. Certainly, I, C Io, as I is an ideal, so we need to show I C I,,. 
To this end, let r be an arbitrary element of J. Since R is a Euclidean 
domain, we know that 


r =roq+t for some g € R and some t € R with t = 0 or d(t) < d(r0). 


Now IJ is an ideal, r € J andro € J, sot=r—roqg El. 
If t were not zero, it would be an element of J with d(t) < (ro) = 
s9, which is impossible, as sp is the smallest norm of any nonzero 
element of J. Hence, t = 0, r = roq, so r € Ip, as required. 


Corollary 3.4.12. The following rings are PIDs: 


(a) Z 
(b) Rix] for any field R 
(c) Zfi] 


Proof. 


(a) Immediate from Theorem 3.4.3 and Theorem 3.4.11. 
(b) Immediate from Theorem 3.4.4 and Theorem 3.4.11. 
(c) Immediate from Theorem 3.4.7 and Theorem 3.4.11. 
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3.5 Integral domains and divisibility 


One of the main reasons we are interested in PIDs is in connection 
with questions of divisibility and factorization. But we will start by 
discussing questions of divisibility in integral domains in general. 


Definition 3.5.1. Let R be an integral domain. Let a and 6b be 
elements of R with b 4 0. Then a is divisible by b, or is a multiple of 
b, or b divides a, or is a factor of a, if there is an element qg of R with 
a = bq. We write this as bla. o) 


Definition 3.5.2. Let R be an integral domain, and let a be a 
nonzero element of R. 


(a) ais a unit of R if a divides 1. 

(b) a is irreducible in R if a is not a unit of R and whenever a = bc, 
b,c € R, then 6 is a unit or c is a unit. 

(c) ais prime in R if a is not a unit and whenever a divides a product 
bc, b,c € R, then a divides b or a divides c. ‘. 


Remark 3.5.3. You may be a bit surprised by Definition 3.5.2. You 
are undoubtedly familiar with (and, indeed, we have used many times 
in this book so far) the notion of a prime in Z, and that looks very 
much like the definition of an irreducible element in a general integral 
domain rather than that of a prime. But it turns out that the correct 
generalization of the notion of a prime in Z is a prime in R as we 
have stated it. v) 


The notions of prime and irreducible in a general integral domain 
are distinct, but there is a close relationship between them. 


Lemma 3.5.4. Let R be an integral domain, and let a be prime in 
R. Then a is irreducible in R. 


Proof. Suppose that a is prime in R. Let a = bc. We have to show 
that b is a unit or c is a unit. Now if a = bc, then certainly a divides 
bc (bc = a = al). Since a is a prime, a divides 6 or a divides c. 
Suppose that a divides b, and write b = ab’. Then 


al = a= bc = (ab’)c = a(b'c) 


so by cancellation in an integral domain (Lemma 3.1.14) 1 = b’c and 
cis a unit. Similarly, if a divides c then 0 is a unit. 
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Remark 3.5.5. As we shall see, it is not always true that an irre- 
ducible element of an integral domain R is prime, although in many 
important cases, including the case of R a PID, it is. © 


Definition 3.5.6. Let R be an integral domain and let a and b 
be nonzero elements of R. Then a and b are associates if a divides 
b and b divides a. We write this as a = b. (This is not standard 
notation.) 7) 


Lemma 3.5.7. Let R be an integral domain. 


(a) a =b is an equivalence relation on R— {0} 
(b) a=) tf and only if a = bu for some unit u, or b = av for some 
unit v 


Proof. We leave this as an exercise for the reader. 


Remark 3.5.8. Note that if b divides a and 0b’ is an associate of b, 
then b’ divides a as well. If b! = bu for a unit u of R, and uv = 1, 
then if a = bq, a = blq = b(uv)q = (bu)(vqg) = Ud for qd! = vg. © 


Definition 3.5.9. Let {a;} bea set of elements in an integral domain 
R. Then g is a greatest common divisor (gcd) of {a;} if 


(1) g divides a; for each 7; and 
(2) if dis any element of R that divides each a;, then d divides g. > 


Remark 3.5.10. How do we know that {a;} has a gcd? That is, 
how do we know that there is an element g of R satisfying these 
two conditions? The answer is, we don’t. As we shall see, there are 
integral domains R in which gcd’s do not in general exist. But as we 
shall also see, there are many cases in which they do. In particular, 
we shall see that if R is a PID, gcd’s always exist. Not only that, we 
will see that they have a stronger property. © 


Remark 3.5.11. Again you may be puzzled by the term gcd. What 
does “greatest” mean? In fact, looking at the definition, a gcd should 
really be called a most divisible common divisor, rather than a great- 
est one. We use this term for historical reasons. The notion of a gcd 
goes back to Euclid, who considered the positive integers, and there 
the gcd of a and b was indeed the greatest (i.e., largest) common 
divisor of a and b. But when mathematicians generalized this notion 
to integral domains, we kept the term. © 
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Although we don’t know that a gcd exists, let us proceed for the 
moment and assume that it does, and see what we can say about it. 


Lemma 3.5.12. Suppose that {a;} has a gcd g. Then g' is a gcd of 
{a;} if and only if g' and g are associates. Thus, if g is a gcd of {aj}, 
all gcd’s of {a;} are given by g' = gu for some unit u of R. 


Proof. By property (2) of a gcd, g divides g/ and g’ divides 
g, so they are associates. Then the conclusion follows from 
Remark 3.5.8. 


Remark 3.5.13. From Lemma 3.5.12 we see that it is improper to 
speak of the gcd of {a;}; rather we need to speak of a gcd of {a;}. We 
will use the (nonstandard) notation g = gcd({a;}) in this case. This 
is related to, but not the same as, the notation a = 6 in Definition 
3.5.5, but we use the same notation because these are closely related. 
Similarly, we will use the notation gcd({a;}) = gced({bj}) to mean 
that any gcd of {a;} is a gcd of {b;}; again this is another use of this 
notation, but again we continue to use it because of the close relation 
of these ideas. (Strictly speaking, we should write gcd({a;}) for the 
set of all gcd’s of {a;} and then write g € gcd({a;}) to mean that g 
is a gcd of {a;}, But this notation is clumsy and unintuitive. Nobody 
uses it, and we won’t either.) ?) 


Remark 3.5.14. Observe that, in our language, the principal ideal 
{rro | r € R} in a commutative ring with 1 generated by the element 
ro consists precisely of the multiples of ro in R. © 


Theorem 3.5.15. Let R be a principal ideal domain (PID). Let {a;} 
be a set of elements of R, not all zero. Then {a;} has a gcd g. Fur- 
thermore, g is a generator of the ideal I generated by {a;}, and so g 
can be written as 


g= Se abi for some {bj} CR, 
a finite sum (i.e., all but finitely many b; = 0). 


Proof. Let I be the ideal of R generated by {a;}. Since R is a PID, 
TI is principal, so is generated by a single element g. We claim that g 
is a gcd of {a;}. 

First of all, property (1) is true as a; € I for each i, and I consists 
precisely of the multiples of g (Remark 3.5.14). 
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Now note that, since g € I, we can write g as 
g= os ajb; forsome {b;} € R, 


a finite sum (Definition 3.2.13). 
Suppose that d divides each a;, so that a; = da’, for each i. Then 


9 = > aibi = S“(daj)b; = d O32 abi) 


so d divides g, and property (2) is true as well. 


Corollary 3.5.16. In any one of the following rings: 


(a) Z 
(b) Rix] for any field R 
(c) Zl] 


the conclusion of Theorem 3.5.15 holds. 


Proof. Immediate from Corollary 3.4.12. 


In order to aid us in our theoretical development, let. us define a 
new object. 


Definition 3.5.17. An integral domain R is a GCD domain (resp. 
an f-GCD domain if every set (resp. every finite set) {a‘} of elements 
of R, not all zero, has a gcd. 


Remark 3.5.18. We observe that every GCD domain is an f- 
GCD domain, and that every Noetherian f-GCD domain is a GCD 
domain. © 


Remark 3.5.19. We have been focusing on, and will continue for 
a while to focus on, theoretical properties of gcd’s. But you may 
well ask, how do we in practice go about finding them. We will 
later see a very effective method, Euclid’s algorithm, for computing 
gcd({a1,a2}) for a pair of nonzero elements a, and a2 of a Euclidean 
domain R. © 


We record a couple of results now that will be very useful in our 
computations later. 


Lemma 3.5.20. Let R be an f-GCD domain. 
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(a) Letace R, a0. Then a= gcd({a}). 

(b) Letbe R,b #0. Ifa is any element of R such that b divides a, 
then b & gcd({a, b}). In particular, b = gcd({b,0}). 

(c) Letbe R,b #0. Leta € R and let q andr be any elements of 
R witha =bq+r. Then gcd({a, b}) = gcd({b, r}). 


Proof. (a) is immediate and (b) is almost immediate as every ele- 
ment of R divides 0. To show (c), we will show that a and b, and b 
and r, have exactly the same common divisors. Then they will cer- 
tainly have the same gcd. To that end, suppose that d divides both 
a and b. Then d will divide b (by assumption) and r = a — bq. On 
the other hand, suppose d divides both b and r. Then d will divide b 
(by assumption) and a = bq +r. 


Lemma 3.5.21. Let R be a GCD domain (resp. an f-GCD domain) 
and let B and C be sets (resp. finite sets) of elements of R, not all 
zero. Let A= BUC. Then gcd(A) = gcd({gcd(B), gcd(C)}). 


Proof. Let g4 = gced(A), gp = gced(B), go = ged(C), and g = 
gecd({gp,gc}). We want to show that g4 & g. 

By definition, g4 divides every element of A, so it divides every 
element of B, in which case it divides gp, and it divides every element 
of C, in which case it divides gc. Thus ga is a common divisor of gp 
and gc, so gq divides g = gcd({gp, gc}). 

On the other hand, by definition g divides both gp and gc. Since 
g divides gp, it divides every element of B, and g divides gc, so g 
divides every element of C. Thus g divides every element of A = 
BUC, so g divides ga. 


Henceforth we will simplify our notation (as is standard) and write 
g = gcd(aj,a2) rather than g = gcd({ai,a2}), g = gcd(a1, a2, a3) 
rather than g = gcd({a1, a2, a3}), ete. 


Remark 3.5.22. Lemma 3.5.21 has a very practical application. 
Suppose we can compute the gcd of any two elements of R (see 
Remark 3.5.19). Then we can inductively compute the gcd of any 
finite number of elements of R: 


gcd(a1,a2,a3) = ged(ay, gcd(ag, a3)), 


gcd(a1, a2, a3, a4) = gcd(aj, gcd(az,a3,a4)), ete. 0) 
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We now define a property that will, as we will see, play a key role. 
Definition 3.5.23. 


(a) Let R be a GCD domain (resp. an f-GCD domain). A set (resp. 
a finite set) {a;} of elements of R is relatively prime if 1 = 
gcd ({a; }). 

(b) Let R be an f-GCD domain. A set {a;} of elements of R is 
pairwise relatively prime if 1 = gcd(a;,a;) whenever i ¥ j. © 


To see the difference between these two notions, observe that 
{6, 10,15} is a relatively prime subset of Z but is not pairwise rela- 
tively prime. 


Lemma 3.5.24. Let R be a GCD-domain (resp. an f-GCD domain) 
and let {a;} be a set (resp. a finite set) of elements of R, not all zero. 
Let g = gcd({a;}). 


(a) Let d be a common divisor of {a;}, (i.e., d is a divisor of each 
a;). Then g/d = gcd({a;/d}). In particular, {a;/d} is relatively 
prime if and only if d = g. 

(b) Let m be any nonzero element of R. Then gm = gcd({ma;}). 


Proof. For simplicity, we will assume that our set just consists of 
two elements of R, which we write as {a,b}. 


(a) Let a = da’ and b = db’. Since d is a common divisor of a and 
b, d divides g. Write g = dg’. We claim that g’ © gcd(a’,b’). To 
show this we must show / satisfies both properties of a gcd: 


(1) We have that g divides a, i.e., that dg! divides da’, and hence 
by cancellation that g’ divides a’; similarly g/ divides b’. 

(2) Suppose that h’ is a common divisor of a’ and b’. Then h’ 
divides a’ so dh’ divides da’ = a; similarly dh’ divides b. Thus 
dh’ is a common divisor of a and b, so dh’ divides g = dq’. 
Then, by cancellation, h’ divides d’. 


(b) This follows from part (a) by changing our point of view. Let 
a@ = ma, b= mb, and g = gcd(a,b). Then a = G/m, b = b/m so 
by part (a), if g = gcd(a,b), then g/m & g, ie., g = mg. 


Now we come to a result that plays a key role. 
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Lemma 3.5.25 (Euclid’s lemma). Let R be an f-GCD domain. 
Let a be any nonzero element of R. Let b and c be elements of R 
and suppose that a divides bc. If a and 6 are relatively prime, then a 
divides c. 


Proof. Let d = gcd(ac, bc). Since c divides both ac and bc, c divides 
d. Write d = ce. Now d divides bc, i.e., ce divides bc, so e divides 6. 
Also, d divides ac, i.e., ce divides ac, so e divides a. Thus e divides 
gcd(a,b). But a and 6 are assumed to be relatively prime, i.e., 1 = 
gcd(a, b), so e divides 1, i.e., e is a unit, and soe = 1, andd=c. 
Now a certainly divides ac, and a divides bc by hypothesis, so a 
divides d, and hence, since d = c, a divides c. 


Remark 3.5.26. This proof was short, though a bit tricky. We will 
see that in the case of a PID, Euclid’s lemma has an even shorter 
and more straightforward proof. © 


Remark 3.5.27. Euclid’s lemma is false in general without the 
assumption that a and 6 are relatively prime. For example, in Z, 
6 divides 210 = 10- 21 without dividing either factor. © 


Here are two important consequences of Euclid’s lemma. 


Corollary 3.5.28. Let R be an f-GCD domain and let a and b be 
nonzero elements of R. Let c be an element of R and suppose that 
a divides c and b divides c. If a and b are relatively prime, then ab 
divides c. 


Proof. Since a divides c, we may write c = ad for some element 
d of R. Then b divides ad, and b and a are relatively prime, so, by 
Euclid’s lemma, b divides d. Write d = be for some element e of R. 
Then 


c = ad = a(be) = (ab)e 


and so ab divides c. 


Remark 3.5.29. This corollary is false in general without the 
assumption that a and 0 are relatively prime. For example, 6 divides 
30 and 10 divides 30, but 6-10 = 60 does not divide 30. © 


Corollary 3.5.30. Let R be an f-GCD domain and let a,b, and c 
be elements of R. If a and b are relatively prime, and a and c are 
relatively prime, then a and be are relatively prime. 
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Proof. Let d = gcd(a,bc). Let e ~ gcd(d,b). Since d divides a, e 
divides gcd(a,b) = 1. Thus e = 1, i.e., d and b are relatively prime. 
Now d divides bc, and d and 6 are relatively prime, so, by Euclid’s 
lemma, d divides c. Thus d is a common divisor of a and c. But 
a and c are relatively prime, so d & 1, i.e., a and bc are relatively 
prime. 


Here is a third, particularly important, consequence of Euclid’s 
lemma. 


Corollary 3.5.31. Let R be an f-GCD domain and let a be an ele- 
ment of R. Then a is prime if and only if a is irreducible. 


Proof. We already know, by Lemma 3.5.4, that in any integral 
domain, every prime is irreducible. So we must show that if R is an 
f-GCD domain, every irreducible is prime. 

Let a € R be irreducible, and let d be a divisor of a. By the 
definition of an irreducible element, there are only two (mutually 
exclusive) possibilities: d= 1 or d = a. Now suppose that a divides 
bc. We must show that a divides b or a divides c. If a divides b, we 
are done. Suppose not. Then, since d = gcd({a,b}) is a divisor of 
a, we must have d = 1, i.e., a and 0 are relatively prime. Then, by 
Euclid’s lemma, a divides c. 


Remark 3.5.32. We want to point out an important detail about 
the conclusion of Lemma 3.5.24. Let R be an f-GCD domain and let 
a,b € R, not both zero. Let g = gcd(a,b). Write a = ga’, b = gb’. 
Then Lemma 3.5.24 (a) tells us that a’ and Ob’ are relatively prime. 
But it is not necessarily the case that a’ and g are relatively prime, 
or that b’ and g are relatively prime. Here is an example to illustrate 
this. Let R = Z, let a = 12, and let b= 18. Then g = 6, anda = 6-2, 
b = 6-3. Sure enough, 2 and 3 are relatively prime, but neither 2 
and 6, nor 3 and 6, are relatively prime. © 


3.6 Principal ideal domains and unique factorization 
domains 


In this section we focus on principal ideal domains (PIDs) and com- 
plete step (2) of our program, to show that 


(2) PIDs have important properties. 
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We can break step (2) into sub-steps, to show that 


(2a) PIDs have a variety of important properties, and 
(2b) PIDs have a particularly important property, that 
of “unique factorization” (which we have yet to 
define). 
Actually, we have essentially already accomplished the first sub- 
step. But as these properties were derived in various different points 
in our development, for both convenience and clarity we will collect 


them here. 
First we recall the basic definition, Definition 3.4.9. 


Definition 3.6.1. Let R be a commutative ring with 1. An ideal I 
of R is principal if it is generated by a single element ro of R, or, 
equivalently, if J = {rro | r € R}. 

An integral domain R is a principal ideal domain (PID) if every 
ideal in R is principal. © 


Theorem 3.6.2. Every Euclidean domain is a PID. 
Proof. This is Theorem 3.4.11. 


Corollary 3.6.3. The following rings are PIDs: Z, R[x] for R a 
field, and Z{i]. 


Proof. This is Corollary 3.4.12. 


Theorem 3.6.4. Let R be a PID. Let {a;} be a set of elements of 
R, not all zero. Then {a;} has a gcd g. Furthermore, g is a generator 
of the ideal I generated by {a;}, and so g can be written as 


g= Se aibj for some {bj} CR, 
a finite sum (i.e., all but finitely many b; = 0). 
Proof. This is Theorem 3.5.15. 


In the language of Definition 3.5.17, the first sentence of the con- 
clusion says that every PID is a GCD domain. The second sentence 
of the conclusion (beginning “Furthermore” ) is a stronger property 
of PIDs, not shared by all GCD domains. 

We have Euclid’s lemma, Lemma 3.5.25, but we remarked there 
when we proved it that it has an easier and more straightforward 
proof in the case of a PID. We restate it and give that proof now. 
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Lemma 3.6.5 (Euclid’s lemma). Let R be a PID. Let a be any 
nonzero element of R. Let b and c be elements of R and suppose that 
a divides bc. If a and b are relatively prime, then a divides c. 


Proof. We are given that a and b are relatively prime, i.e., that 
~ gcd(a, b) (Definition 3.5.23). Then, by Theorem 3.6.4, 


l=ar+bs forsome r,sER 
and then 
c= c(ar + bs) = a(br) + (bc)s. 


Now a visibly divides the first term on the right-hand side, and by 
hypothesis a divides bc, so a divides the second term as well. Hence, 
a divides their sum, which is c, as claimed. 


Then we have several consequences of Euclid’s lemma. 


Corollary 3.6.6. Let R be a PID. Let a and b be nonzero elements 
of R. Let c be an element of R and suppose that a divides c and b 
divides c. If a and 6 are relatively prime, then ab divides c. 


Proof. This is Corollary 3.5.28. 


Corollary 3.6.7. Let R be a PID. Let a,b, and c be elements of R. 
Ifa and b are relatively prime, and a and c are relatively prime, then 
a and be are relatively prime. 


Proof. This is Corollary 3.5.30. 


Corollary 3.6.8. Let R be a PID. Let a be an element of R. Then 
a is prime if and only if a is irreducible. 


Proof. This is Corollary 3.5.31. 


Now we have a look at the Chinese remainder theorem for PIDs. 


Lemma 3.6.9. Let R be a PID. Let I and J be ideals in R, generated 
by elements a and b respectively. 


(a) The ideals I and J of R are coprime if and only if the elements 
a and 6 are relatively prime. 

(b) If a and b are relatively prime then IJ = IM J is the ideal 
generated by ab. 
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Proof. 


(a) By definition (Definition 3.2.23), I and J are coprime if [+ J = 
R. Then this conclusion follows immediately from Theorem 3.6.4 
and Lemma. 3.2.6. 

(b) This follows directly from Corollary 3.6.6. 


Let I be an ideal of the PID R, generated by an element 19. We 
adopt the standard notation a = b (mod ro) for a = b (mod I). 


Theorem 3.6.10 (Chinese remainder theorem). Let R be a 
PID. Let {am, ..., An} be a set of pairwise relatively prime elements 
of R. Let ry, ..., Tr be arbitrary elements of R. Then the system of 
simultaneous congruences 


x =r, (mod aj) 


x =rg (mod az) 


L=Tn (mod ap) 
has a solution x = b in R, and x = 0! is a solution if and only if 


b’ = b (mod ay --: Gp). 


Proof. This is Theorem 3.2.26, stated in our language, and using 
Lemma 3.6.9. 


The general results we have stated so far have evidently been 
closely related to questions of divisibility. Here is one more general 
result that. we have already proved. On the face of it, it doesn’t seem 
to have much to do with divisibility. But appearances are deceiving, 
and we will soon see the essential role it plays. 


Theorem 3.6.11. Let R be a PID. Then R is a Noetherian ring. 


Proof. Immediate from Theorem 3.2.14. 


Now we came to our main result, “unique factorization”. But let 
us see what we should mean by that. Thinking about the positive 
integers, we know we can factor 6 are 6 = 2-3, a product of primes. 
Now this factorization is, strictly speaking, not unique, as we also 
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have 6 = 3-2. Thus the first thing we see is that we wish to consider 
two factorizations to be “essentially” the same if they only differ in 
the order of the factors. 

But now let us think about factorization in Z. Here we have 


We see that what we have done here is simply spread unit factors 
(recall the units in Z are {+1}) around in such a way that they cancel, 
and we want to consider all of these factorizations to be “essentially” 
the same as well. Recall that two nonzero elements that differ by a 
unit factor are associates (so here 2 and —2 are associates, as are 3 
and —3). 

With these considerations in mind we can formulate what it means 
for factorizations to be “essentially” unique. 

But we have one more consideration before we do so. We can 
always group terms together, so that we can write, for example, 4 = 
2-2 = 2? if we wish, and it is often convenient to do so. Again, we 
also have 4 = (—2)?. Now of course 2 4 —2, but once again they 
are associates. Thus what we will mean in our second formulation by 
primes being “distinct” is not just that they are not the same, but 
also not associates of each other. 

It is illuminating to state the property of “unique factorization” 
separately, and then to formulate our main result as saying that PIDs 
have this property. 


Definition 3.6.12. An integral domain R is a unique factorization 
domain (UF'D) if every nonzero element a of R can be written as 


Gd=UPp, ... Dp 
for some unit u and primes pj, ..., p,, and if also 
G=vq1 .-- ds 


for some unit v and primes qi, ..., ds, then r = s and after possible 
reordering, p; and q; are associates fori = 1, ..., r. Equivalently, R 
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is a UFD if every nonzero element a of R can be written as 
Oe) tan Bo 


where u is a unit, pj, ..., Dm are distinct primes, and e1, ..., €m are 
positive integers, and if also 


a = vq!" gin 
where v is a unit, q1, ..., Gn are distinct primes, and fi, ..., fn are 
positive integers, then n = m and after possible reordering, p; and q; 
are associates and f; = e; fori =1,..., m. © 


Theorem 3.6.13. Let R be a PID. Then R is a UFD. 


Proof. First we show that every nonzero element a and R has 
a factorization as in Definition 3.6.12, and then we show that the 
factorization of a is essentially unique. 

The first step has two substeps. Then first substep is to prove the 
following claim: 


Claim. Every nonzero element a of R that is not a unit is divisible 
by some irreducible element p of R. 


Proof of claim. If a is irreducible, set p = a and we are done. If a 
is not irreducible, write a = a,b; with neither a; nor 6, units. If a, is 
irreducible, set p = a; and we are done. If a, is not irreducible, write 
a, = a2b2 with neither a2 nor b2 units (so that a = a,b, = a2(b2b1)). 
If ag is irreducible, set p = ag and we are done. If ag is not irreducible, 
write ag = a3b3 with neither a3 nor b3 units. If ag is irreducible, set 
p = az and we are done. If not, continue. 

We must show this process stops at some stage, say stage n, in 
which case p = Gy is irreducible. Suppose not. Let J, be the principal 
ideal generated by aj, Jo the principal ideal generated by ag, [3 the 
principal ideal generated by a3, ... . Then, since by, b2, b3, ... are not 
units, we have an infinite chain 


ClpaCcIgcC-:-:- 


which is impossible, as a PID is a Noetherian ring (Theorem 3.6.11). 
The second substep is to prove the following claim: 
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Claim. Every nonzero element a of R can be written as 
A= Up, ... Dr 
with u a unit and p,, ..., p, irreducible. 


Proof of claim. If a is a unit, set w= a and we are done. 

If a is not a unit, then a is divisible by some irreducible element 
p; of R. Write a = piq1. If qi is a unit, set u = q, and we are done. 
If not, gq, is divisible by some irreducible element p2 of R. Write 
Pi = p2qg2 (so that a = (pip2)q2). If qo is a unit, set wu = qo and we 
are done. If not, once again, continue. 

Again we must show that this process steps at some stage, say 
stage r, in which case, setting u = q,, a = up, ... pr. Again, suppose 
not. 

Let J, be the principal ideal generated by pj, [> the principal ideal 
generated by po, .... Then, since q1,q2,... are not units, we have 
an infinite chain 


K,Clhc... 


which is again impossible as R is a Noetherian ring. 

Thus, we have finished step 1. 

Now for step 2, essential uniqueness. Suppose we have two factor- 
izations as in Definition 3.6.12. Since R is a PID, every irreducible 
is a prime (Corollary 3.6.8) so we have two factorizations of a into 
primes 


a= Up, ... pr =Uq1 .-- ds- 


Now pp, visibly divides a, so it divides the right-hand product. 
By the definition of a prime, it must divide one of the factors. It 
certainly doesn’t divide the unit factor, so it must divide one of the 
other factors. By reordering, if necessary, we may assume it divides 
qi. But q, is irreducible, so we must have q; = p,v’ for some unit v’, 
in which case p; and q; are associates. Thus, 


a= Uupy ... Pr = U(v'pi)g «+, 9 
so by cancellation 
a’ = upe ... pp = (vv’')qo ... Qa. 


Again we apply the same argument: p divides a’, so must divide 
some term on the right-hand side, which, after possible reordering, 
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we may assume to be qg, in which case qg = pov” for some unit vu”, 
and p2 and qe are associates. Thus, 


td 


a’ = up2 ... pp = (vu'v")pog3 ... As 
so by cancellation 


a” = up3 ... pp = (vv'v")q3 ... Qs 


and continue, matching up the factors 1 by 1, until we are done. 


We conclude this section by investigating properties of UFDs. As 
we will see later (Remark 3.11.10) there are important examples of 
rings that are UFDs but not PIDs. 


Lemma 3.6.15. Let R be a UFD. Let p be an element of R. Then 
p is prime if and only if p is irreducible. 


Proof. Recall from Lemma 3.5.4 that if p is prime then p is irre- 
ducible. So we must show that if p is irreducible then p is prime. So 
suppose that p is irreducible and that p divides a = bc. We must 
show that p divides 6 or p divides c. Since p divides a, a certainly 
has a factorization into irreducibles 


a= Upp2 .-- Pi- 


Suppose p does not divide b. Then 6 has a factorization into 
irreducibles 


b= vq s+ Qj, 


with none of the q’s an associate of p. Similarly, if p does not divide 
c then c has a factorization into irreducibles 


C= WwW, «++ Tr; 


with none of the r’s an associate of p. 
Now a = beso we see that a has the factorizations into irreducibles 


= Uppa 20. Py = (UW) G1 os GFP one TR 


But then a has two distinct factorizations into irreducibles (these 
two factorizations being distinct as p appears in the first but not in 
the second); contradiction. 


Ring Theory 199 


Lemma 3.6.16. Let R be a UFD and let a and b be nonzero elements 
of R. Let a and b have factorizations into powers of distinct primes 


A jue ei ft fj 
Q= up; ... DQ sed 
a ni Gl 95h h 
b= vq; ody ty. ee 


Then, 4) dy = MM. G5 )s nog ye = I); 


qi... qd; = gcd(a, b). 


Proof. We leave this as an exercise for the reader. 


Lemma 3.6.17. Let R be a UFD. Then R is a GCD domain. 


Proof. We also leave this as an exercise for the reader. 


Remark 3.6.18. As we have observed, the gcd is only defined up 
to multiplication by a unit. In the case of Z, the units are +1, so the 
gcd is only defined up to sign. Here we make the convention that we 
always choose the plus sign, so that the gcd is a positive integer. In 
the case of Riz], R a field, the units are the nonzero elements of R. 
Here we make the convention that the gcd is a monic polynomial. 
For a general GCD domain, there is no preferred choice, with one 
exception: If elements are relatively prime, we make the convention 
that their gcd is 1. But we stress that these are matters of convention, 
not of necessity. © 


Example 3.6.19. We set R = Z and note that R is a Euclidean 
domain, and hence a PID. 


(a) We see immediately that 2 and 3 are relatively prime, i.e., 1 ~ 
gcd(2,3), and almost as quickly that 1 = 2(—1) + 3(1). 

(b) We see immediately that 10 = 2-5 and 33 = 3-11 are relatively 
prime, i.e., 1 = gcd(10,33), but it takes a little work to see that 
1 = 10(10) + 33(—3). 

(c) We see that 360 = 23 - 32-5 and 700 = 2? - 52-7, and so, by 
Lemma 3.6.16, 20 = 27-5 & gcd(360, 700). Then we can see that 
20 = 360(2) + 700(—1). 

(d) Suppose we want to find gcd(161, 1001). It takes some work to 
factor these. When we do, we find 161 = 7-23, 1001 = 7-11-18, 
so 7 = gcd(161, 1001). Then also 7 = 161(56) + 1001(—9), but it 
does not seem at all easy to find this expression. 
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(e) To find ged(2501, 4551) we must factor these numbers, which 
is not at all easy. But it turns out that 2501 = 41. 61 and 
4551 = 41-111, so 41 © ged(2501, 4551). Then also 41 = 
2501(—20) + 4551(11) and again it does not seem at all easy 
to find this expression. 

(f) How about ged (12345, 54321)? It turns out that 3 & 
gcd(12345, 54321) and 3 © 12345(3617) + 54321(—822). 

(g) How about gcd(124816, 618421)? It turns out that these two 
numbers are relatively prime, i.e., 1 = gcd(124816, 618421), and 
1 = 124816(—266427) + 618421(53773). 


Remark 3.6.20. Just proceeding as we have so far, to find g = 
gcd(a,b) we would have to factor a and 6, and this quickly becomes 
impractical as a and 6 get large. Even if we accomplish this first step, 
this doesn’t at all help us in the more difficult second step of finding 
x and y with g = ax + by. But in the next section we will not only 
see how to easily find g without having to factor a and b, and almost 
as easily how to find such integers x and y. © 


We can now see the essential role that the gcd plays in unique 
factorization. 


Definition 3.6.21. Let R bearing. Then R is principally Noetherian 


if every sequence of principal ideals I, C Ig C [3 C ... is eventually 
constant, or, otherwise said, if every sequence of principal ideals , C 
Ig C Ig C ... is finite. v) 


Theorem 3.6.22. Let R be an integral domain. Then R is a UFD 
if and only if R is a principally Noetherian f-GCD domain. 


Proof. Suppose that R is a principally Noetherian f-GCD domain. 
Consider the proof of Theorem 3.6.13, that every PID is a UFD. 
Step 1 of the proof goes through unchanged as we are assuming R is 
principally Noetherian. Step 2 of the proof goes through unchanged 
as we are assuming that R is an f-GCD domain, and hence, by 
Corollary 3.5.31, every irreducible element of R is prime. 
Conversely, if R is a UFD it is easy to check that R is principally 
Noetherian, and we have already observed that R is a GCD domain 
(and hence an f-GCD domain) in Lemma 3.6.17. 
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Remark 3.6.23. We close this section by observing that at this point 
we can breathe a huge sign of relief. We started out by assuming 
properties of the integers that we stated in Appendix A, and used 
these properties in our earlier development. But now that we have 
shown that Z is a PID, and hence a UFD, we have proved that these 
properties hold. So we are now standing on firm logical ground. 0 


3.7 Euclid’s algorithm 


Let R be a Euclidean domain. In this section we present Euclid’s 
algorithm, a very effective method of first, finding g = gcd(a, b) for 
any two elements a and b of R, not both zero, and second, writ- 
ing g = ax + by with x and y elements of R. (Historically speaking, 
Euclid’s algorithm came first, and then we defined Euclidean domains 
to be the integral domains in which Euclid’s algorithm works.) We 
will develop this algorithm, give a variety of examples in various sit- 
uations, and show how to apply it in the Chinese remainder theorem. 

We fix a Euclidean domain R and a Euclidean norm 6 on R. Let 
a, b € R, not both zero. If b = 0 then a & gcd({a,b}) and if a = 0 
then b = gcd(a, b), by Lemma 3.5.20(b). Thus we have found the ged 
in these simple (and not very interesting) cases. The interesting case 
is when a and 0 are both nonzero. 


Algorithm 3.7.1 (Euclid’s algorithm). Let a, b € R be nonzero. 
Set a9 = a, a, = b. Then, by the definition of a Euclidean domain 
(Definition 3.4.1), there are elements q; and az of R with 


ao = 4191 + 42 with ag = 0 or d(ag) < d(az). 


Suppose ag # 0. Then, similarly, there are elements gg and az of 
R with 


a1 = 4292 + a3 with a3 = 0 or 5(a3) < d(ag). 
Suppose ag # 0. Then, there are elements g3 and a4 of R with 
a2 = a3q3 + a4 with ag = 0 or d(a4) < 5(a3). 


Continue ... 
Claim. This process cannot go on forever. 
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Proof of claim. If it did, we would have an infinite sequence 
@1,42,a3,... with 


6(a1) > d(a2) > d(a3) Se ene 


But each d(a;) is a nonnegative integer, and it is impossi- 
ble to have an infinite strictly decreasing sequence of nonnegative 
integers. 


Hence, this sequence stops at some stage n, 1.e., @n—1 = Angn + 
Gn41 With an, = 0. 
Let us write down what we have obtained: 


ag = a1q1 + a2 
a, = a2q2 + 43 


a2 = 4393 + G4 


An—3 = An—29n—2 + An-1 


An—2 = An—-19n—-1 + An 
An—1 = Gngn + 0 


Now we know (from Lemma 3.5.20(b)) that a, = gcd(dn, 0). 
But we also know 


gcd(an_1,@n) = gced(ap,0) by Lemma 3.5.20(c), 
gcd(an—2, @n—1) = gcd(an_1,a,) by Lemma 3.5.20(c), 
gcd(an—3, @n—2) = gcd(an_2,an-1) by Lemma 3.5.20(c), 


d(a3,a4) by Lemma 3.5.20(c), 
d(az,a3) by Lemma 3.5.20(c), 


gcd(az, a3) = ge 
gcd(a1, a2) = ge 
gcd(ao, a1) = gced(aj, a2) by Lemma 3.5.20(c), 
Following this chain from the bottom up, we see 
an = ged(an,0) = gcd(an—1, an) = ged(@n—3, @n—1) = gcd(an—z, @n—2) 


... & ged(az, a3) © ged(a1, a2) ¥ ged(ao, a1). 
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Thus, an = gced(ao,a;) and we have found a gcd of our original 


two elements of R. 
Furthermore, from the next-to-the last equation we see that 


an = An—2(1) + Gp=1(—Ga-i) 


and we have expressed a, in terms of ajn—2 and anj_;. But we can 
solve the equation above that one for a,_; and substitute: 


an = An—2(1) + [Q@n—3(1) + Gn—2(—Gn—2)|(—@n—1) 
On 3 — Gna) Oy + Gyo gaat }: 


We don’t need to keep track of these exact coefficients for the 
proof (though we will certainly need to do so in our computations). 
We just observe that we have now expressed a, in terms of a,—3 and 
Gn—2. We use the equation above that to express ay, in terms of an—4 
and anj—3. We continue to “roll up”. By the time we get to the first 
equation we have expressed a, in terms of a; and az, and when we 
have used that equation we have expressed a, in terms of ag and aj, 


gcd(a, b) = ayn = ax + by for some z,y € R, 
as desired. . 


Example 3.7.1. We let R = Z and do several examples of increasing 
complexity. 


(a) gced(33,10): 


33 = 10-343 
10=3-3+4+1 
3=1-3 


so 1 = gcd(33, 10), and then 


1 = 10 +3(—3) = 10 + [33 + 10(—3)](—3) = 33(—3) + 10(10). 
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(b) ged(1001,161): 
1001 = 161-6 +35 
161 = 35-4421 


35 = 21-1414 
21=14-14+7 
21=14-1 
14=7-2 


so 7 = gcd(1001,161), and then 
714) 
= 21 + [35 + 21(—1)](—1) = 35(—1) + 21(2) 
= 35(—1) + [161 + 35(—4)](2) = 161(2) + 35(—9) 
= 161(2) + [1001 + 161(—6)](—9) = 1001(—9) + 161(56). 


Note that if we only wanted to find gcd(1001, 161), we could have 
stopped with the second equation 161 = 35-4-+ 21 as we can see 
right away that 7 = gced(35, 21). 

(c) ged(4551,2501): 


4551 = 2501 - 1 + 2050 
2501 = 2050-1+ 451 
2050 = 451-4 + 246 
451 = 246-1+4 205 
246 = 205-1+ 41 
205 = 41-5 
so 41 © gcd(4551,2501), and then 
41 = 246 + 205(—1) 
= 246 + [451 + 246(—1)](—1) = 451(—1) + 246(2) 
= 451(—1) + [2050 + 451(—4)](2) = 2050(2) + 451(—9) 
= 2050(2) + [2501 + 2050(—1)](—9) = 2501(—9) + 2050(11) 
= 2501(—9) + [4551 + 2501(—1)](11) = 4551(11) + 2501(—20). 
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(d) ged(54321,12345): 
54321 = 12345 - 4 + 4941 
12345 = 4941 - 2+ 2463 
4941 = 2463-2+4 15 
2463 = 15-164+43 
15=3-5 
so 3 = gcd(54321,12345), and then 
3 = 2463 + 15(—164) 
= 2463 + [4941 + 2463(—2)](—164) = 4941(—164) + 2463(329) 
= 4941(—164) + [12345 + 4941(—2)](329) 
= 12345(329) + 4941(—822) 
= 12345(329) + [54321 + 12345(—4)](—822) 
= 54321 (—822) + 12345(3617). 


(ec) gced(618421,124816): 
Recall that the Euclidean norm on Z— {0} is given by 6(n) = |n|, 
so given a and b we want to express a as a = bqg+r with r = 0 or 
|r| < |b]. We have previously always chosen r > 0. But we don’t 
have to. In particular, we may always choose r so as to make 


|r| as small as possible. This can speed up computations. We do 
that here: 


618421 = 124816 - 5 + (—5659) 
124816 = (—5659)(—22) + 318 
5659 = 318(—18) + 65 
318 = 65(5) + (—7) 
65 = (—7)(—9) +2 
7 = 2(—3) + (-1) 
2 = (-1)(-2) 
so we see —1 = gcd(618421,124816), or, equivalently, 1 = 


gcd(618421,124816), i.e., 618421 and 124816 are relatively prime. 
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(Once again, we could have concluded this from the fourth equa- 
tion 318 = 65(5) + (—7) as we can see right away that 65 and —7 


are relatively prime.) 


Now, for the second step: 


S(=(27) 426) 


= (—7) + [65 + (—7)9](3) 


= 65(3) + [318 + 
= 318(28) + [—56 
= (—5659)(—137) 
= (—5659)(—137) 
= 124816(—2438) 


= 65(3) + (—7)(28) 
65(—5)](28) = 318(28) + 65(—137) 
59 + 318(18)](—137) 
+ 318(—2438) 
+ [124816 + (—5659)(22)](—2438) 
+ (—5659)(—53773) 


( 
= 124816(—2438) 
= 618421(- 


and also 


+ [618421 + 124816(—5)](—53773) 


53773) + 124816(266427) 


1 = 618421(53773) + 124816(—266427). 


(f) gcd(871,455,273): 


Here we use Lemma 3.5.21 (or Remark 3.5.22): 


ecd(871,455,273) 


= gcd(871, gcd(455,273)). 


We compute the inner gcd first. We use positive remainders, and 
leave it to the reader to check that we obtain 91 ~ gcd(455,273) and 
91 = 455(—1) + 273(2). We then compute gcd(871,91). Again we use 
positive remainders and we leave it to the reader to check that we 


obtain 13 ~ gcd(871,91) and 
13 = 871(2) 4 


13 = 871(2) + 91(—19). Finally, 
| 91(—19) 


= 871(2) + [455(—1) + 273(2)](—19) 


= 871(2) + 


+ 455(19) + 273(—38). © 


Example 3.7.3. We now let R = Z/i] and we recall that R has the 
multiplicative Euclidean norm 6(a + bi) = a? + b? (Theorem 3.4.7). 


We will follow the strategy of 
Euclid’s algorithm. 


the proof of that theorem in performing 
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(a) gcd(33 + 29%, 9 + 124): 
Let ag = 334297, ay = 94127. We observe that 6(a;) = 9?+12? = 
225. 
We compute 


334+ 297  33+4+291 9-121 645 — 1357 


94126 94121 9-12) ~— 225 
Now the nearest integer to “ is 3 and the nearest integer to 


=135 is —1, so we choose g, = 3+ (—1)i = 3—i. Then 


ay = A141 + G2 SO a2 = ag — a1q1 = (33 + 292) — [(9 + 127)(3 — 7)] 
= (33 + 29%) — (39 + 27%) 
= —-6+ 22. 


Thus the first step in Euclid’s algorithm is 
33 + 291 = (9 + 127)(3 — 7) + (-6 + 22). 


Note 5(a2) = (—6)? + 2? = 40 < 6(a) as we expect. 
Next we compute 


9+4+12i 9412 -6-2i  ~-30-90i 


642i -6412i -6-2i 40 
Now the nearest integer to — is —1 and the nearest integer to 


=90 is —2, so we choose gz = —1+ (—2)i = —1 — 24. Then 


a1 = a2qG2 + a3 SO ag = ay — A2qQ2 = (9 + 122) 
[(—6 + 22)(—1 — 2i)] 
= (9 + 12i) — (10 + 10%) 
=-14 21. 


Thus the next step in Euclid’s algorithm is 


9 + 124 = (—6 + 24)(—1 — 21) + (-1 + 24). 


Note 5(a3) = (—1)? = 5 < d(az) as we expect. 
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Next we compute 


—6 + 27 _ 24 -1— 2% 1 107 
pea (ae 
1+ 21 1+ 21 1— 22 5 


Thus, we see that —1-+ 27 divides —6 + 27, and we are done. 
We summarize: 
33 + 91 = (9 + 127)(3 — 2) + (—6 +4 27) 
9 + 12i = (—6 + 27)(—1 — 27) + (—1 +4 21) 
—6 + 127 = (—1 4 22)(2 + 27) 


( 
( 
and we see —1 + 21 = gcd(33 + 293, 9 + 21). 
Furthermore, 


142i = (9 + 12%) + (—6 + 24)(1 + 2%) 
= (9 + 12%) + [(33 + 291) + (9 + 124)(—3 + 4)](1 + 22) 
= (33 + 29i)(1 + 2%) + (9 + 12%)(—4 — 54). 


= 


ged(9 — 2i,4 + 74): 
Let ap = 9—2i, ay = 4+7i. We observe that 6(ao) = 97+ (—2)? = 
85 and 6(a,) = 47 + 7? = 65 


We compute 


9-2 9-21 4-71 22-711 


447 4471 4-7) 65 


Now the nearest integer to — is 0 and the nearest integer to — 
is —1, so we choose q, = —i. Then 


dg = 4141 + a2 SO ag = ag — aq, = (9 — 2%) — [(4 + Ti) (—2)] 
= (9 — 2%) — (7 — 4i) = 242i. 


Thus, the first step in Euclid’s algorithm is 
9 — 24 = (44 72)(—-7) + (24+ 2%). 


Note 5(a2) = 2? + 2? = 8 < 6(a,) as we expect. 
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Next we compute 


44+7 4471 2-27 22+61 


242i 2421 2-2% °&8 


Now the nearest integer to ~ is 3 and the nearest integer to 8 is 
1, so we choose gg = 3+7. Then 


a1 = A2q2 + a3 SO ag = a1 — Aoq2 = (44 Ti) — [(2 + 27)(3 + 2)] 
= (447i) — (448i) =i. 


Now we observe that —7 is a unit in Z[i]. So —7 certainly divides 
2+ 2i; indeed 2 + 2 = —i(—2 + 27) and we have: 
9 — 24 = (4+ 7i)(—i) + (2 + 2%) 
4+ 74 = (2+ 27)(3 + 7) + (-Z) 
2+ 27 = (—7)(—2 + 2%). 
More to the point, since —7 is a unit, we conclude that 9 — 27 and 


4+ 7i are relatively prime, 1 = gcd(9 — 27,4 + 7i). 
Furthermore, 


i = (44 74) + (2+ 2%)(-3 - 4) 
= (4+ 7i) + [(9 — 2%) + (4+ 7A)i](—3 — 3) 
= (9 — 2i)(—3 — i) + (4+ Ti)(2 — 34) 


1 = (9 — 24)(1 — 34) + (4 + 7é)(3 + 24). 


Note that if had just wanted to see whether 9 — 27 and 7+ 47 were 
relatively prime, we could have stopped this process earlier. Since we 
have a multiplicative norm, if g = gcd(a, b), then g divides a, so 0(g) 
divides 6(a), and g divides b, so 6(g) divides 6(b); hence 6(g) divides 
gcd(d(a),6(b)). In our case here, if g = ged(9 — 27,7 + 47) then also, 
from the next step, g = gcd(4+ 7i,2 + 27). But 6(4+4+ 72) = 65 and 
6(2 + 27) = 8, and these integers are relatively prime, so we must 
have 6(g) = 1 and hence g is a unit, and we may conclude without 
further ado that 9 — 22 and 4+ 77 are relatively prime. © 
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Example 3.7.4. We now consider the polynomial ring R[x], where R 
is a field, and recall that this ring has the Euclidean norm 46(p(x)) = 
deg p(x). 


(a) Let R = Q, the field of rational numbers. We wish to find 
gcd(a? +a+2,x2+22+3), and for this we use the usual division 
algorithm for polynomials. We have: 

a? +o +2 = (a? + Qe + 3)(2 — 2) + (22 +8) 
zg? +2¢ +2 = (22 + 8)(2/2-1) +11 
2/2—1=11(2/22 — 1/11). 


We note that 11 is a unit in Q/z], so these two polynomials are 
relatively prime, 1 & gcd(x? + 2 + 2,27 + 2x + 3). Furthermore, 


11 = (x? + 2x +3) + (22 + 3)(—2/2 +1) 
= (a? + 22 + 3) 4+ [(e? + 2 + 2) 
+(a? + Qe + 3)(—2 + 2)|(—2/2 +1) 
= (2° +2 +2)(—2/2 +1) + (x? + Qn + 3)(a7/2 — 22 +3), 


or 


1 = (a +a42)(—2/224+1/11)+(x?+22+3) (2? /22—22/11+3/11). 


ics 


Now let R = Zy1, the integers modulo 11. Since 11 is a prime, 
Z, is a field. Again we wish to find gcd(x#? + 2+ 2,27 +27 +38). 
To do so, we can reuse our work from the last computation. 


The first step is almost unchanged: 


ge +o¢4+2— (274+ 2243)(x4+9) + (Qr +8). 


The only difference is that —2 = 9 (mod 11), so we have replaced 
x—-2byx+9. 

The second step is more interesting. In our previous computation, 
when we divided x? + 22 + 3 by 2a + 8 we obtained a quotient of 
a/2—1 and a remainder of 11. Now the quotient «/2 —1 = $a — 
1 = 2-l(x) — 1. But in Zy,, 2-1 = 6 (as 2-6 = 1 (mod 11)), and 
—1= 10 (mod 11). Also, in Z;, 0 = 11 (mod 11), so the remainder 


Ring Theory 211 


is 0, ie., 2a + 8 divides x? + 22 + 3. Thus, Euclid’s algorithm gives 
here 


ge +o¢4+2= (27 +224 3)(x +9) 4+ (2 +8) 
a? +22 +3 = (22 + 8)(62 + 10) 


Thus, «+ 4 & ged(a? + & + 2,27 + 22 + 3). 
Furthermore, again using congruences (mod 11), 


Qn +8 = (2? +242) 4+ (x? + Qe + 3)(10x + 2) 


or 
+4 = (22 +24 2)(6) + (x? + 22 + 3)(52 4+ 1). % 


Now we will see how to apply Euclid’s algorithm in conjunction 
with the Chinese remainder theorem. To keep things relatively sim- 
ple, we will restrict our attention to R = Z. 


Example 3.7.5. 

(a) Let R = Z. We wish to solve the simultaneous congruences: 
x = 25 (mod 64) 
x = 49 (mod 121) 


Since 64 and 121 are relatively prime, the Chinese remainder 
theorem (Corollary 3.2.27) guarantees us that this system has a 
solution. We wish to find it, and to do so we adopt the strategy of 
the proof of Theorem 3.2.26. That is, we want to find an element 
hy of Z with hy = 1 (mod 64) and hy = O (mod 121), and an 
element hg of Z with hz = 0 (mod 64) and hz = 1 (mod 121). 
Then we obtain a solution 6 = 25h; + 49h2. How can we find hy, 
and hg? By Euclid’s algorithm! 

We apply Euclid’s algorithm to 121 and 64, and we obtain 


1 = 121(9) + 64(—17). 
Now certainly 121(9) = 0 (mod 121), as 121(9) is visibly divisible 
by 121. But the above equation gives the congruence 
1 = 121(9) + 64(—17) (mod 64). 
We see that the second summand 64(—17) = 0 (mod 64), as 


64(—17) is visibly divisible by 64. Thus, 121(9) = 1 (mod 64). 
Hence we may choose h; = 121(9) = 1089. 
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By exactly the same logic we see that 64(—17) = 0 (mod 64) 
and 64(—17) = 1 (mod 121), so we may choose hg = 64(—17) = 
—1088. Thus we have a solution b = 25(1089) + 49(—1088) = 
—26087. This is a perfectly valid solution, but we wish to be neat 
about this and get a solution between 0 and 7743=64-121-—1. 
(Remember that the Chinese remainder theorem tells us that 
our solution is not unique; it is only unique (mod 64-121).) A 
little arithmetic shows that —26087 = 4889 (mod 64 - 121), so 
our solution is 


x = 4889 (mod 7744). 


(b) Let R = Z. We wish to solve the simultaneous congruences: 


x = 6 (mod 21) 
x = 5 (mod 23) 
x = 4 (mod 25) 


Again, since {21,23,25} is a pairwise relatively prime set of inte- 
gers, the Chinese remainder theorem guarantees us that this sys- 
tem has a solution. We use the same method to find it. First, we 
want to find an integer h; with h; = 1 (mod 21), hy = 0 (mod 23), 
hy, = 0 (mod 25). Now h, = 0 (mod 23) means hy is divisible by 23 
and hy = 0 (mod 25) means hy is divisible by 25. Since 23 and 25 
are relatively prime, this means that hy, is divisible by their product 
23-25=575. Thus we apply Euclid’s algorithm to 21 and 575 to obtain 


1 = 575(8) + 21(—219) 
and by the same logic as before we see that if hy = 575(8) = 4600, 
hy = 1 (mod 21), hi = 0 (mod 23), hy = 0 (mod 25). 
Next we wish to find hz with hg = 1 (mod 23), hz =0 (mod 21), 


hz = 0 (mod 25). We apply the exact same logic to 23 and 21-25=425 
to obtain 


1 = 525(—6) + 23(137) 


and take hg = 525(—6) = —3150. 


Ring Theory 213 


Finally, we wish to find hg with hj = 1 (mod 25), hg = 
0 (mod 21), hs = 0 (mod 23), and we apply the exact same logic 
a third time to 25 and 21-23= 483 to obtain 


1 = 483(—3) + 25(58) 


and take h3 = 483(—3) = —1449. 

Then we obtain a solution b = 6(4600) + 5(—3150) + 4(—1449) = 
6054 and note that this solution is unique mod 21 - 23-25 = 12075, 
so the solution to our system of congruences is 


ax = 6054 (mod 12075). 6 


Although it is implicit in our previous work, it is worth explicitly 
writing out the method we have used in finding solutions to simulta- 
neous congruences. Again, for simplicity, we restrict ourselves to the 
case R = Z. 


Theorem 3.7.6. Let {m,,..., mn} be a set of pairwise relatively 
prime nonzero integers. Let M = my, ..., Mn and let My = M/mzx, 
k =1,...,n. Let x, and y, be integers with myx, + Mey, = 1 
and set hy =1—mpap = Myy, fork =1,..., 1. Let ry, ..., Tn be 
arbitrary integers. Then the system of simultaneous congruences 


x1 =r, (mod m4) 


Ln =Tn (mod m,) 


has the unique solution (mod M) given by 


L= Shere (mod M). 
k=1 


Proof. Note that m, and My, are relatively prime for each k, and 
so such integers x, and yz exist (and we can find them by using 
Euclid’s lemma). Then hy = 1 (mod mx) and hy = 0 (mod M;), and 
this latter condition implies hy = 0 (mod m,) for all 7 A k. Then it 
is easy to verify that x = X?_ herp (mod M) is indeed a solution of 
this system, and this solution is unique (mod M). 
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3.8 Applications to number theory 


This section consists of two independent parts. They are unified in 
that each is an application of our results in group and ring theory to 
prove a beautiful and important theorem in number theory. These 
theorems are Fermat’s theorem on the sum of two integer squares, 
and the Law of Quadratic Reciprocity. In each case the proofs we 
present are not the original proofs, but rather proofs that use ideas 
that were historically developed much later. 

Fermat’s theorem states that every prime p = 1 (mod 4) is a 
sum of two squares, p = a? + b? for some integers a and b. Actually, 
his theorem is more precise, and we will state and prove the precise 
version below. But here are some examples: 


5= 2? 41? 
13 = 3? +2? 
7=44+ 17 
625 a? 
s7=0 47 
41 =57 +44? 


99989 = 2307 + 217? 
618421 — 7862 + 252 


Indeed, the representation of p as a sum of squares is “essentially” 
unique. What do we mean by essentially here? Note, for example, we 
can write 


5 = 2? 41? = 2? 4 (-1)? = (-2)? + 1? = (-2)? + (-1)° 
= P42 =1? + (-2)* = (-1)? +2? = (-1)? + (-2)? 


and we want to consider these eight possibilities to be essentially the 
same, so essentially unique should mean up to the signs and order of 
a and b. 

What about primes p = 3 (mod 4)? We can rule these out immedi- 
ately, as it is easy to check that for any integers a and b, a?+b? = 0,1, 
or 2 (mod 4). 
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In fact, Fermat determined exactly which integers can be 
expressed as a sum of two squares. This follow easily from the above 
result, and we shall derive this, too. 

Fermat lived in the 17th century. He claimed this theorem, 
although he did not pass along this proof, but we believe him, and 
credit the theorem to him. He did describe his proof as being by his 
“method of descent” (a method that involves the contrapositive of 
mathematical induction) and Euler wrote down a proof along these 
lines in the 18th century. The proof we will give is that given by 
Dedekind in 1894. The key to this proof is a fact that on the surface 
of it appears completely unrelated to this theorem, the fact that Z/7] 
is a UFD! Given this fact, we will be able to prove this deep theorem 
in just a few paragraphs. 


Theorem 3.8.1 (Fermat). Let p be a prime with p = 1 (mod 4). 
Then p = a? + b? for some integers a and b, unique up to sign and 
order. 


Proof (Dedekind). Let R = Z/i] and recall we have the norm 6 
on R defined as follows: If z € R, z =7r+ si, then Z = r — si and 
6(z) = 22 =r? +8”. Recall that 6 is a multiplicative Euclidean norm 
on R and hence that R is a Euclidean domain, hence a PID, hence a 
UFD. 

As we have shown, since p = 1 (mod 4), —1 is a quadratic 
residue (mod p) (Corollary 2.7.13), i.e., there is an integer d with 
d? = —1 (mod p). Then d? + 1 = 0 (mod p), ie., d? + 1 is divisible 
by p. But d? +1 = (1 + di)(1 — di), a factorization in R. Thus, p 
divides this product, but does not divide either factor (as neither 
(1+di)/p nor (1—di)/p is in R), so p is not a prime in R. Hence, p is 
not irreducible in R (as in any UFD, primes and irreducibles are the 
same), so p = rs for some r,s € R, neither of which is a unit. But 
then 6(p) = 6(rs) = 6(r)6(s), ie., p? = 6(r)6(s). Since r and s are 
not units, d(r) € 1 and 6(s) 4 1 (Remark 3.4.8). Thus we must have 
6(r) = 6(s) = p. Setting r = a+ bi, 5(r) =a? +b’, sop =a? +0? for 
some integers a and b (and you can check then that we must have 
s =7 =a — bt), so, as far as existence goes, we are done! 

Now for essential uniqueness. Suppose p = a? + b? = c? + d?. 
Observe that all of a,b,c, and d are nonzero (as otherwise p would be 
a perfect square). Let r = a+bi and s = c+di so p* = 6(p) = 6(r)d(s) 
and 6(r) = 6(s) =p. We must have r and s irreducible (as if r = tu, 
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p = 0(r) = 6(t)d(u), so either 6(t) = 1 and ¢ is a unit, or 6(u) = 1 and 
u is a unit, and similarly for s). Once again, R is a UFD so the irre- 
ducibles in R are the same as the prime in R. Thus, r is a prime and r 
divides rr = p = s5, sor must divide s or r must divide 5, i.e., s = rv 
or §=rv for some v € R. Since 6(r) = 6(s) = 6(S5), 6(v) =1 and v 
is a unit. Now the units in R are {+1,+i}. Thus, we have two pos- 
sibilities (r divides s or r divides 5) and for each possibility we have 
four choices for v, giving a total of eight possibilities, and direct cal- 
culation shows these eight possibilities just give all the possible signs 
and order of a and b, so we have essential uniqueness, and we are 
done! 


We now give a proof, from our viewpoint, of a result that was 
known to Diophantus (3rd century C.E.). 


Lemma 3.8.2. Let m and n be positive integers, each of which is a 
sum of two squares. Then their product mn is a sum of two squares. 


Proof. Suppose m = a? + b? and n = c? + d?. Set r =a+ bi and 
s =ctdi. Then m = r7 and n = 85,80 mn = (r7)(s3) = (rs) (75) = tt 
with t= rs. Thus if t = e+ fi, mn = e? + f?. 


Remark 3.8.3. This proof gives an explicit formula. We have t = 
e+ fi = (a+ bi)(c+ di) = (ac — bd) + (ad + bc)i so e = ac — bd and 
f =ad-+ bc. Thus, we obtain the algebraic identity 


(a?+b*)(c? +d?) = (ac—bd)?+(ad+bc)?. .) 


Corollary 3.8.4 (Fermat). Let N be a positive integer. Then N is 
a sum of two integer squares if and only if the highest power of any 
prime q = 3 (mod 4) dividing N is even. 


Proof. If N =1, N =1?+40?, so the corollary is true for N = 1. 
If N =2, N =1?+41?, so the corollary is true for N = 2. 
Suppose JN is as stated. Then N factors as a product of distinct 

prime powers 


; 2 20k 
N = 2p! ++ phi gi + go 


where pi, ..., pj = 1 (mod 4) and q, ..., qx =3 (mod 4). Let 


f 


M = 2°pi eee P; 
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and 


Qa -- git 


so that N = MQ?. 

Now, as we have just observed, 2 is a sum of two squares, and 
by Fermat’s theorem, each of pj, ..., pj is a sum of two squares. 
Thus, applying Lemma 3.8.2 repeatedly, M is a sum of two squares, 
M =a? +0?. But then 


N = MQ? = (a2 + 82)Q? = (aQ)? + (bQ)? 


is a sum of two squares as well. 

Now suppose N is a sum of two squares. 

We make a preliminary observation: 

Let gq be a prime with g = 3 (mod 4). If u and v are integers not 
divisible by g, then u? + v? is not divisible by q. For if u? + v? = 
0 (mod q), then u? = (—1)v? (mod q). Now, since g = 3 (mod 4), —1 
is not a quadratic residue (mod q) (Corollary 2.7.13). Then, on the 
one hand, from the left-hand side of this congruence we evidently see 
that u? is a quadratic residue (mod q), while from the right-hand 
side of the congruence we see, with the help of Lemma 2.7.10, that 
it is not; contradiction. 

Now suppose N = 2? + y? for some integers x and y. Let g be a 
prime with g = 3 (mod 4) that divides N. If x = 0 or y = 0, then N 
is a perfect square, so certainly the highest power of qg dividing N is 
even. Suppose not. Write x = q°u and y = q/v, where u and v are 
not divisible by g. We may assume that e < f (otherwise interchange 
x and y). Then 


N= 2 dug? = mets + qfy? = or ie qty?) 


If e < f then q?/~¢ is divisible by q, so the parenthesized expres- 
sion is congruent to u? (mod q); since u # 0 (mod q), u? 4 0 (mod 
q), so this expression is not divisible by g and the highest power of q¢ 
dividing N is q?°, an even power. 

If e = f then the parenthesized expression is u? + v?, and by our 
earlier observation u? + v? is not divisible by q, so the highest power 
of q dividing N is q?°, again an even power. 

Thus, in any case, if N is a sum of two squares, the highest power 
of q dividing JN is even, as claimed. 
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Next we come to the Law of Quadratic Reciprocity. Although 
we will not be using it again in this book, this is one of the great 
theorems of number theory, and every student of mathematics should 
see it (at least) once. 

The Law of Quadratic Reciprocity was first proved by Gauss in 
his Disquisitiones Arithmeticae, published in 1801. Gauss returned 
to this theorem many times in his life, and produced a total of seven 
different proofs. It has been reproved many times, by many different 
people, since. We will present a proof due to Zolotarev from 1872. 
The basic idea of this proof is to look at signs of permutations. The 
lemma that starts us off, Zolotarev’s lemma, has a very easy proof, 
but, as you will see, the proof of the Law of Quadratic Reciprocity 
that we give is rather tricky- this deep theorem has no really easy 
proof. 

Let us fix an odd prime p. We recall from Definition 2.7.8 that 
we have the quadratic residue character on the multiplicative group 
Zy, defined by xp(a) = 1 if a is a quadratic residue (mod p) and 
Xp(a) = —1 if a is a quadratic nonresidue (mod p). (In comparison 
to that definition, we are simplifying our notation here by writing a 
instead of [a]. We will continue to use this simplification throughout 
this section.) We also recall from that definition that we have the 
Legendre symbol (2 defined by (F) = xp(@). 

Recall from Corollary 3.3.11 that Z> is cyclic. A generator of Z;, 
is called a primitive root (mod p). For example, 2 is a primitive 
root (mod 5) as the powers of 2 (mod 5) are {1,2,4,3}. It is not a 
primitive root (mod 7) as the powers of 2 (mod 7) are {1,2,4}, but 3 is 
a primitive root (mod 7) as the powers of 3 (mod 7) are {1,3,2,6,4,5}. 

Here is the lemma that starts the ball rolling. 


Lemma 3.8.5 (Zolotarev’s lemma). Let p be an odd prime. Let 
a be relatively prime to p, and let fig:Z, — Zy be the permutation 


Ha(x) = ax (mod p). Then xXp(a) = sign(Up). 


Proof. Let r be a primitive root (mod p). Then a = r* (mod p) 
for some k, sO fg = pl, But pu,.% = (fur)* (where by power we mean 
composition), so sign(Ja) = sign(u,)*. But also yp(a) = xy,(r*) = 
Xp(r)*. Thus we need only show x,(r) = sign(s-). 

On the one hand, y,(r) = —1 as r is a quadratic nonresidue, 
being an odd power (the power 1) of the generator r, and hence a 
nonsquare in the group of even order Z>. 
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On the other hand, let us write the elements of Z} in the 
order (lrr?... r?-?). Then we see p,-(1) = r,u-(r) = r,... 
pr(r?-2) = rP-! = 1. In other words, p, is a single (p — 1)- 
cycle. But p— 1 is even so this is an odd permutation, ie., 


sign(u,) = —1. 


Corollary 3.8.6. Let p be an odd prime. 


(a) Ifa is relatively prime to p, and [q: Zp + Zy is the permutation 
Ha(x) = ax (mod p), then Xp(a) = sign(Up). 

(b) For any a, if aq: Zp + Zp is the permutation ag(x) = a+ 
x (mod p), then sign(aq) = 1. 


Proof. 


(a) fa on Z, only differs from fq on ZF by the fact that pa(0) = 0, 
i.e, we have an additional l-cycle, and that does not affect 
the sign of a permutation, so this follows immediately from 
Lemma 3.8.5. 

(b) This is certainly true for a = 0, as then ag is the identity. For 
a # 0, note that a, = (a1)* (where by power we again mean com- 
position). Writing the elements of Z, in the order (012... p—1), 
we see that a;(0) = 1,a,(1) = 2,..., ai(p — 1) = 0. In other 
words, @, is a single p-cycle. But p is odd so this is an even 
permutation, i.e., sign(a,) = 1. 


Theorem 3.8.7 (Law of Quadratic Reciprocity). Let p and q 
be odd primes. Then 


(=e 


Proof (Zolotarev). We stated this in terms of Legendre symbols, 
as this is most commonly done, but we will use quadratic residue 
characters in our proof. 

We begin by considering the ring Z, x Zz = {(a,b) |O<a< 
p—1, 0<b<q-—1}. We write the elements of this ring in a p- 
row, qg-column rectangle, rows numbered 0, ..., p — 1 and columns 
numbered 0, ..., g—1, so that the entry in row a, column 8, is (a, 6). 
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0 (0,0) (0,1) (0,2) eae (0, q— 1) 
2 q 5 5 3 
p-1|(p-1,0) (p-1,1) (p-1,2) ... (P—1,¢-1) 


We recall that we have an isomorphism of rings (and hence a 
bijection) y: Zpg + Zp x Zq given by 


y(c) = (c (mod p), ¢ (mod q)). 

We first consider the permutations on this array defined by 

C(a,b) = (qa +b, b) and O(a, b) = (a,a + pb). 
We note that 

y(qa + b) = (qa + 0,6) and y(a + pb) = (a,a 4+ pb) 
and so 

y ‘C(a,b) = qa+b and y '6(a,b) = a+ pb. 

These are both bijections, so we may consider the bijection 
o=(~ ‘(ey 0)". 


Then o(y10) = y~'¢ so we see that o: Zpqg 4 Zygq is the bijection 
(i.e., permutation) 


ao(a+ pb) =qa+t+b. 
Now from the equation o(y~ 10) = ~~ !¢ we see 
sign(c) sign(y ') sign(@) = sign(y *) sign(¢) 
sign(c) sign(@) = sign(¢) 


and since the sign of a permutation is +1, 
sign(@) sign(¢) = sign(c). 


We now evaluate each of these three signs. We begin with 0. 
Note that @ leaves each of the rows in this array fixed, and per- 
mutes the entries within the rows. Let 6, be the permutation on 
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row a of the array, so 0 = 696; ... 8)-1 and hence sign(@) = 
sign(0)sign(1) ... sign(#,-1). Now for each a, 0, is the permuta- 
tion on Z, given by @, = gp where ag and py are as in Lemma 
3.8.6. Thus, sign(6,) = sign(aq)sign(Up) = Xq(p) by Lemma 3.8.6, 
and then sign(#) = xq(p)? = xq(p) as p is odd. By exactly the same 
logic, sign(¢) = xp(q). Thus, we see that the left-hand side of this 
equation is xq(p)xXp(q). To complete the proof we now show that 


p-1 q-1 


sign(o) = (-l1) 22. 

We determine sign(a) by counting inversions. Let us order the 
elements of the above array by columns. This gives an ordering of Zpg 
by counting down the columns. i.e., (0,0) © 0,(1,0) o 1,..., (p— 
1,0) p= 1, (0,1) 5, De ee Lng om Lg 1) Spe — 1. 

Note that under this ordering 


(a,b) @ a+ pb. 


In other words, if c € Zpg isc = at+pb,0<a<p—-1,0<b<q-l, 
then c is the cth element in this ordering. 

Suppose instead we decide to order the elements of this array by 
rows. Then (0,0) © 0, (0,1) <1, ... are under this ordering 


(a,b) qa+b. 


In other words, if d € Zpg isd = qa+b,0 <a<p—-1,0<b<q-l, 
then d is the dth element in this ordering. 

Now @ is given by o(a + pb) = qa +b. So we can describe o as 
follows: Let c € Zpg. Count down the array by columns 0,1, ..., 
until we reach position c. Then count 0,1, ... across the array by 
rows until we reach the c-th element in our count. Suppose that the 
element in that position is d. Then o(c) = d. 

Now to count inversions. An inversion occurs when c’ precedes 
c in our ordering, but when o(c) = d precedes o(c’) = d’ in our 
ordering. In our ordering, c’ precedes c when we get to c’ in counting 
by columns before we get to c, or, equivalently, when c’ is anywhere 
in a column to the left of where c is, or in the same column as c but 
above c. In other words, if ¢ is in position (a,b) and c’ is in position 
(a’,b’), c’ precedes c if a’ <a, or if a’ =a and dD! <b. 

Consider these possibilities for c’. We now make the key observa- 
tion that if c’ is in row b or above, i.e., if b’ < b, then we will reach 
zx’ in counting by rows before we get to c, i.e., in these cases d’ will 
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precede d in our ordering — no inversions. But if c’ is in a row below 
row b, then we will reach c’ in counting by rows after we get to c, 
i.e., in these cases d’ will follow d in our order-all inversions. 

Thus, we see that for any (a,b), we get an inversion from (a’, b’) 
when (a’,b’) lies in a corner of our array strictly to the left of and 
below (a, 0): 


0 

a e (a,b) 
a+ljile e e 

p-lle ie e 


Thus, we simply need to count the number of these pairs 
((a’, b), (@,0)). 

We will do so by moving along rows. 

Suppose we are in row 0 (i.e., a = 0). If we are in column 0, there 
are no entries below and to the left. If we are in column 1, there is 
a single column to the left, which has entries in rows, 1, ..., p—1 
below our entry, i.e., a total of p — 1 entries. If we are in column 
2, there are two columns to the left, each with (again) p — 1 entries 
below our entry. Proceeding in this way, we see we obtain a total of 
(= la 2p ip see tg i) pL a 2 ee op 
q—1) = (p— 1)(¢ — 1)(q)/2 inversions. Now for row 1. Again we 
get nothing from column 0. Now we get 1 fewer entry, i.e., p — 2 
entries, below and to the left of the entry in column 1, 2(p — 2) 
entries below and to the left of the entry in column 2, etc., for a total 
of (p—2)--2(p—2)4 > Ip 2) = (p22 eg) = 
(p—2)(q—1)(q)/2 inversions. Proceeding in this way we see we obtain 
a total of (p—1)(q—1)(q)/2+ (p—2)(q—1)(g)/2+ +--+» +(g—-1)(q)/2 = 


= —1). E 
pip) . Ce) inversions. Hence, 


sien(o) _ (ls ee = (yn) = ayes 


as p and q are both odd. 
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Remark 3.8.8. As the determination of the permutation o, and 
the computation of its sign, in this proof were rather tricky, we will 
illustrate these by an example. We take p = 5 and q = 3 and write 
the column ordering in the respective positions. We have: 


0 5 10 
1 6 il 
2 7 12 
3 8 13 
4 9 14 


Then we see o(0) = 0,0(1) = 5, 0(2) = 10, 0(3) = 1,0(4) =6,... 
Writing the elements in this array (rather than linearly) for clarity, 


we see: 
0 5 10 0 11 8 
1 6 Il 59 2 13 
o 2 7 12 =|10 7 4 
3 8 13 1 12 9 
4 9 14 6 3 14 
and we see we have inversions (5,1), (5,2), (5,3), (5,4), (10,1), (10,2), 
(10,3), (10,4), (10,6), (10,7), (10,8), (10,9), (6,2), (6,3), (6,4), ..., 


(13,4), (13,9), as in the proof. ?) 


Corollary 3.8.9. Let p and q be odd primes. 


(a) If at least one of p and q is congruent to 1 (mod 4), then either 
both p is a quadratic residue (mod q) and q is a quadratic residue 
(mod p) or both p is a quadratic nonresidue (mod q) and q is a 
quadratic nonresidue (mod p). 

(b) If both p and q are congruent to 3 (mod 4), then either p is a 
quadratic residue (mod q) and q is a quadratic nonresidue (mod 
p), or p is a quadratic nonresidue (mod q) and q is a quadratic 
residue (mod p). 


Proof. This is simply a restatement of the Law of Quadratic 
Reciprocity. 
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We close this section with a Zolotarev-style proof of Gauss’s 
lemma (Lemma 2.7.14). 


Lemma 3.8.10 (Gauss’s lemma). Let p be an odd prime and let 
a be an integer that is relatively prime to p. Let 


T={7|1<71<(9-1)/2 and 
ai =k (mod p) for some k with (p+1)/2<k<p-—l1}. 


Let t = #(T). Then (4) (ay 


Pp 
Proof. Write Z5 = {1,2,..., p—1} in two rows in the following 
order 
1 2 ... (p—1)/2 
(p—1) (p—2) (p+1)/2 


Note that 7 and j are in the same column if and only if 7+ 7 = 0 


(mod p). Let fia: Z, — Zp be as in Zolotarev’s lemma. Then (4) = 


Xp(@) = sign(a). We will calculate sign(j1¢) from its action on this 
array. 

Note that if i+ 7 = 0 (mod p), then q(t) + Wa(J) = 0 (mod p), so 
we see that fq permutes the columns of this array, and may or may 
not interchange the entries in a column. Thus we may write UW, = To, 
where o is the permutation on the columns, keeping the entries in 
every column in their same positions, while 7 fixes each column, but 
interchanges the entries in a column exactly when o does. Note this 
happens precisely for column 7 when 7 is in the set TJ’. Thus, 7 is a 
product of ¢ transpositions. 

Now we may write 0 as 0102 where oj is the action of o on the 
first row and o9 is the action of o on the second row. But, since a 
preserves columns, these actions are exactly the same. Thus we see 
La = TO = T0102 SO 


sign(Wq_) = sign(T)sign(o1)sign(o2) = sign(r)sign(o})? = sign(T), 


i.€., 


as claimed. 
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3.9 Some examples of integral domains 


In this section we first want to look a little deeper at Z|], where we 
do have unique factorization, and then look at some examples where 
we don’t. 

We begin with a lemma that we will use in several cases. 


Lemma 3.9.1. Let D be a squarefree integer. Let R = O(VD) and 
let 6 be the multiplicative norm on R given by 6(a+bVD) = |a? — 
b?D|. 


(a) Ifa is an element of with 6(a) = p, where p is a prime, then a 
is irreducible in R. 

(b) Suppose that p and q are prime (perhaps q = p). If R does not 
have an element of norm p, and a is an element of R with 6(a) = 
pq, then a is irreducible in R. 


Proof. Let a = bc. To show that a is irreducible, we must show 
that b or c is a unit. 


(a) If 6(a) = p, then 6(bc) = 6(b)d(c) = p, so 6(b) = 1, in which case 
b is a unit, or 6(c) = 1, in which case c is a unit (Lemma 3.4.8). 
(b) If 6(a) = pq, then 5(bc) = 6(b)d(c) = pg. Since R does not have 
an element of norm p, we cannot have 6(b) = p or 6(c) = p. Thus, 
we must have 6(b) = 1 or 6(c) = 1, in which case 6 or c is a unit, 
as in part (a). 


Now we determine the distinct primes in Z[7]. Recall that two 
primes are said to be distinct if they are not associates of each other. 


Theorem 3.9.2. Let R = O(./—1) = Ziti]. The distinct primes in 
R are: 


(i) 1+ and its associates. (Note that 1—i = —i(1 +1) is an 
associate of 1+i, and —i(1 +i)? =2.) 
(ii) Let p be a prime congruent to 1 (mod 4) and let a and b be 
integers with a? + b? = p. 
(ii(a)) a+ bi and its associates. 
(ii(b)) a — bi and its associates. 
(iii) For p a prime congruent to 3 (mod 4), p and its associates. 


Proof. First we recall that, since Z[i] is a UFD, primes and irre- 
ducibles in Z/i] are the same. 
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If a € Zi] is as in (i), (ii(a)), (4i(b)), or (iii), it follows immediately 
from Lemma 3.9.1 that a is irreducible, and hence prime. 

Now let a € Z|i] with d(a) = gq. If gq = 2 or p for p a prime 
congruent to 1 (mod 4), or p? for p a prime congruent to 3 (mod 
4), then a must be one of (i), (ii(a)), (ii(b)), and (iii). Suppose not. 
Write 6(a) = qor where qo is of this form. Then a@ = gor, r > 1. 
But qo is divisible by some prime element z of Z[7], so z divides a 
or a. If z divides a, switch z and z. Thus, we may assume z divides 
a, soa = zw for some w € Zi]. But then d(a) = 6(z)d(w), ie., 
gor = 0(z)d(w), with d(z) dividing qo, so 6(w) > 1 and w is not a 
unit. Hence, a is not irreducible, so is not prime. 


Now we turn to some examples where things go wrong. 


Example 3.9.3. Let R be a field and let S = R[{x!/"},,en] be the 
ring of Example 3.2.17(c). As we observed there, S is not Noetherian. 
We claim that the element x of S is not divisible by any irreducible. 
It is easy to check that if p(x) is an element of S with at least two 
terms, and q(x) is any nonzero element of S, then p(a)q(x) has at 
least two terms. Thus, the only divisors of x have a single term, so 
must be raz? for some r 4 0 in R (a unit in S) and some positive 
rational number q. But 2% = (x4/?)? so x4 is not irreducible. We also 
see that S is not a GCD domain: The set {x7 | ¢ € Q,q > V2} does 
not have a GCD. On the other hand, S is an f-GCD domain. We can 
see this as follows: Let {pi(x), ..., px(x)} be a finite set of elements 
of S', not all zero. Then there is some integer N such that every term 
in each of these polynomials has exponent of x that is an integer 
multiple of z!/%. Substitute y = x'/" to obtain a set of polynomials 
{pi(y), ---, pe(y)} in the polynomial ring R[y]. Now Rly] is a GCD 
ring, so this set has a ged g(y). Now substitute back 2!/N = 
obtain that g(a!/) is a gcd of {p1(x), ..., pe(x)}. .) 


Example 3.9.4. Let R be a field and let S = R[x”, x°] be the ring 
of polynomials in the variables x? and x? with coefficients in R. Note 
that every power x* with k > 2 is in S (24 = 2? -2?,25 = g?. 
x, etc.). Thus, we may alternatively write S as the subring of R[z] 
given by 


2 
S = {ap + ax” + 43° + +++ +anz” | ag, a2, a3, ..., Gn € R}, 
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i.e., the subring of R[z] consisting of all polynomials that do not have 
an “x” term. 

First. we observe that x? and 2° are both irreducibles in S. (Any 
factorization of x? or 2° would have to have an “x” term.) Then 
g® = g?.g?. 2? = x - x are two distinct factorizations of x® into 
irreducibles, so S is not a UFD. Also, observe that the divisors of x° 
(ignoring unit factors) are {1,2?,23} and the divisors of «® (again 
ignoring unit factors) are {1,x?, 2°, 2*}, so the set of common divisors 
is {1,27,x°}. Hence the elements x° and x® do not have a ged (as 
neither of x? nor x? divides the other in 9), so S is not an f-GCD 
domain. 

But S is Noetherian, as we see from the following argument: Let 
I be a nonzero ideal in S. Let d be the smallest degree of a nonzero 
polynomial in J and let pg(z) be a monic polynomial of degree d in I. 
Note that pg(x) is unique as if not, and p/,(x) were some other, then 
pa(x) —p',(x) would be a polynomial of lower degree in J; impossible. 
Now J may or may not contain a polynomial of degree d+ 1. If 
not, do nothing. If so, let pg;1(x) be a monic polynomial in J with 
the coefficient of 2? in pay1(2) equal to zero (which we can always 
arrange by subtracting an appropriate multiple of pq(x)), and by 
the same argument pa+1(x) is unique. We claim that I is generated 
by the single polynomial pq(x) in the first case, or by the pair of 
polynomials pq(x) and pa+i(x) in the second case. We prove this by 
induction on the degree n of any nonzero polynomial q(x) in J. If 
n = d, then q(x) must be a multiple of pg(x). If n = d+1, then there 
are no such polynomials in the first case, and any such polynomial 
is easily checked to be a sum of multiples of pa(x) and pa+4i(x) in 
the second case. Now assume the result is true for any polynomial 
of degree < n, and let g(x) be a polynomial of degree n > d + 2. 
Then q(a) = x?r(x) for some polynomial r(x) of degree n — 2. Then 
r(x) € I by the inductive hypothesis, and J is an ideal of S, so 


q(x) = x*r(x) € I, and by induction we are done. .) 


3 


Example 3.9.5. 


(a) Let R = O(./—5) = Z|V—5]. Recall that R has the multiplicative 
norm 6(a + b\/—5) = a? + 5b. We can easily make a table of 
elements of R with small norm: 
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Norm Elements of R with this norm 
+1 


+ + /—5), +(1 oa /—5) 


CONDOR WNEH 


No) 
w 


(2+ V=8),£(2 - v=B) 
10 none 


Then we have factorizations: 


6=2-3=(1+V-5)(1— V5) 
9=3-3=(2+/-5)\2—v-5) 


We observe that these are factorizations into irreducibles by 
Lemma 3.9.1. Thus both of these two elements have two distinct 
factorizations into irreducibles, and hence R is not a UFD. 

Let R = O(/—6) = Z[V—6], with multiplicative norm 6(a + 
b\/—6) = a? + 6b. We again make a table of elements of R with 


small norm: 
Norm Elements of R with this norm 


1 +1 


+(1 + /—6), +(1— /—-6) 


9 =o: 
10 t(2 + /—6), +(2 — /-6) 


Then we have factorizations: 


COND OK W WD 
=) 
° 
=) 
ie) 
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Again, these are factorizations into irreducibles by Lemma 3.9.1. 
Thus, both of these elements have two distinct factorizations into 
irreducibles, and hence RF is not a UFD. 

(c) Let R = O(V10) = Z[V10], with multiplicative norm 6(a + 
b/10) = |a? — 1002]. 


We have the factorization 
10 = 2-5 =(v10)(v/10). 


We observe that 2 is not prime, as it divides the product 10 = 
(V10)(10) without dividing either of the factors. We observe that 
2 has norm 4. We claim that R does not have an element of norm 2. 
For suppose x + yV 10 € R with 6(a + yV10) = 2. Then 

x = 10? = EZ 
a? = +2 (mod 5) 


which has no solution. 

Then by Lemma 3.9.1(b) 2 is irreducible. 

We also observe that 5 has norm 25. We claim that R does not 
have an element of norm 5. For suppose z+ yV10 € R with 5(a@ + 
yV10) = 5. Then 


x = 10y? = +5 
a? = +5 + 10y* = 5(2y* +1) 


Thus, x must be divisible by 5, and then 2? is divisible by 25, so 


2y?+1 = 0 (mod 5) 


which has no solution. 

Then by Lemma 3.9.1(b) 5 is irreducible. 

Also, since 10 has norm 10, again, by Lemma 3.9.1(b), V/10 is 
irreducible. 

Thus, 10 has two distinct factorizations into irreducibles and 
hence R is not a UFD. © 


These three rings are Noetherian, however, as we now see. 


Lemma 3.9.6. Let D be a squarefree integer and let R = O(VD). 
Then every nonzero ideal I of R is of one of the following two 
forms: 
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(i) I is a principal ideal generated by a single element a of R. 
(ii) I is an ideal generated by an element a of R and an integer g 
with g dividing 0(a). 


Proof. Let J be a nonzero ideal of R. If J is principal, we are done. 
Suppose not. 

Let z be any element of J, z 4 0. Then J contains the integer zz, 
and it contains zZV/D as well. Let 


S, = {|k| 40 | k is an integer in I} 
Sy = {|n| £40 | m+nvD is in I for some m}. 


Now Sj is a nonempty set of positive integers, so has a smallest 
element ko. Also, S'2 is a nonempty set of positive integers or half- 
integers, so has a smallest element no. Let a = mo + noVD € I. 

Now let b = m-+nvD be any element of I. We claim that n is an 
integer multiple of ng. We can write n = nog +r with 0 < r < no. 
(If no is an integer, this is just the division algorithm. If no is a half- 
integer, this is true as well-just apply the division algorithm to 2n and 
2no). Then b — aq =m! + rVD for some m’, and with 0 < r < no. 
But 6b — aq € I, so by the minimality of ng we must have r = 0. 
Thus we see that n is always divisible by ng. But then b — aq = m’. 
Thus m’ € Sj, so again by the division algorithm we must have that 
m’ is an integer multiple of ko, m’ = jko for some integer 7. Thus, 
b =aq+jko so I is generated by the two elements a and ko. 

Now to finish the proof we “neaten up”. Since a € I, d(a) = a@ € 
I. Since 6(a) and ko are integers in J, g = gcd(d(a), ko) is in I (as 
g = 6(a)s + kot for some s, t € Z, and Z C R). Now kp is a multiple 
of g, so if I is generated by a and kg it is also generated by a and g, 
and, finally g certainly divides d(a). 


3.10 Quotient fields and localization 
To motivate our constructions in this section, let us look at a few 


examples of rings. 
First we have the ring (indeed, the field) of rational numbers 


Q= {a/b | a,b€Z,b #0}, 
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which we are familiar with. Here are a couple of less familiar exam- 
ples. Fix a prime p € Z. We have the rings 


R= {a/b | a,b € Z,b 40,6 a power of p}, 
S = {a/b | a,b € Z,b 40,6 relatively prime to p}. 


In each case, what have we done? We have chosen a “suitable” 
subset of Z that we will allow as denominators in our fractions. Suit- 
able means that we want the result to be a ring, so that it must be 
closed under the operations of addition and multiplication. How do 
we do addition and multiplication in Q, R, or S? By the “usual” 
rules for adding and multiplying fractions. 

But actually we have jumped the gun, because we have already 
written the elements of Q, R and S as a/b, which presumes that we 
already know how to do some arithmetic. If we think more basically 
about this, we should really write them as ordered pairs (a,b), and 
then mimic the “usual” operations of arithmetic on these ordered 
pairs. But if we are careful to do so, the first thing we run up against 
is the fact that different ordered pairs can represent the same fraction 
(eg., 1/2 = 2/4 = 3/6 ...). So what we should do is to put 
a relation (in fact an equivalence relation) on ordered pairs, saying 
that two ordered pairs (a,b) and (c,d) are equivalent if a/b = c/d. 
But of course we can’t phrase it that way, as that would be going 
around in circles, since we are trying to define a/b and c/d. How can 
we do this in a noncircular fashion? Again we can think about when 
“usual” fractions a/b and c/d are equal, and remember that this is 
true when ad = bc. With all this in mind, we can now proceed in a 
very straightforward way. 


Definition 3.10.1. Let R be an integral domain and let A be a 

nonempty subset of R that is closed under multiplication (i.e., if 

a, € A and ag € A then aj,az € A), and assume that 0 ¢ A, 1 € A. 
Define a relation ~ on R x A by 


(1,41) ~ (2,42) if ria2 = rea). 0) 


Lemma 3.10.2. The relation ~ is an equivalence relation on Rx A. 


Proof. We leave this as an exercise for the reader. 
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Definition 3.10.3. Let R,4 be the set of equivalence classes of R x A 
under the relation ~. Ry is called the localization of R away from 
A. We write the equivalence class of (r,a) as r/a. 


Lemma 3.10.4. The operations of addition and multiplication on 
Ra given by 


ai ie a2 aire + aor 
rl r2 rr 
ay ag ayag 


ry T2 Tyre 


are well defined. 


Proof. These are operations on equivalence classes, so we must 
check they are independent of the choice of representatives. Again 
we leave this for the reader. 


Theorem 3.10.5. In the above situation, Ra is an integral domain. 
The zero element of Ra is 0/1 and the identity element of Ra is 1/1. 


Proof. We must verify that the operations on R, satisfy all the 
properties of addition and multiplication in an integral domain. 
Again we leave this for the reader. 


Remark 3.10.6. We regard R as a subset of Ry by identifying r € R 
with r/1 € Ra. v 


Lemma 3.10.7. R%, the units in Ra, is R%4 = {a1/az | a, € A}. 
In particular, if A= R—{0}, R4 = Ra— {0}, so that in this case 
Ra is a field. 


Proof. Again we leave this for the reader. 


Definition 3.10.8. In the above situation, if A = R— {0}, Ry is 
called the quotient field of R. % 


Example 3.10.9. 


(a) If R = Z, the quotient field of R is Q. 

(b) If R= O(VD), the quotient field of R is Q(VD). 

(c) Let R be a field. Then the quotient field of the polynomial ring 
R[x] is {p(x)/q(x) | p(x), a(x) € Riz], q(x) A Of. This field is 
the field of rational functions in x with coefficients in R. 
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(d) More generally, if R is an integral domain with quotient field S, 
the quotient field of R[x] is the field of rational functions in x 
with coefficients in S. 

(e) Let R be a field. Recall that R[[x]] is the ring of formal power 
series in x with coefficients in R, 


R{[x]] = {Doone | a € a 
n=0 


As we have observed, Ri[x]|* = {5°29 | an € R, ao # O}. 
Then the quotient field of R[[2]] is 


{> oo | eh © 
n=—N 


Remark 3.10.10. We have restricted our attention here to integral 
domains for simplicity (and because it is the only case we will need). 
But the construction of R4 goes through more generally. We can 
begin with R any commutative ring with 1. We just need to require 
that the subset A does not contain any zero divisors of R. (Of course, 
if R has zero divisors, R,4 will also have zero divisors.) © 


3.11 Polynomial rings: Unique 
factorization and related matters 


In this section, we fix a UFD (unique factorization domain) R that 
is not a field, and we let F' be its quotient field. The most important 
special case of this is when R = Z, in which case F = Q. But the 
argument in general is exactly the same-word for word and symbol 
for symbol-in the general case as it is in this special case, so we may 
as well give it in general. 

We recall that F'[a] is a PID, and hence a UFD. Done! 

Now R[z] is not a PID: If a 4 0 is any nonunit in R, 


I = {p(x) =ap + 4,2 + +++ + anx” | ag is divisible by a} 


is an ideal of R[x] that is generated by {a,x}, but is not generated 
by any single element of R[x], so Riz] is not a PID. 
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Our goal is to show that R[x] is a UFD. We will show this by 
first looking at polynomials in R[z], and then, at the crucial stage, 
by relating the situation in R[x] to the situation in F'[z]. 


Definition 3.11.1. Let f(x) = anx" + --- + ag be a nonzero poly- 
nomial in R[x]. Then f(x) is primitive if its coefficients {ao, ..., an} 
are relatively prime. © 


Lemma 3.11.2 (Gauss’s lemma). If g(x) and h(x) are primitive 
polynomials in Rix], then their product f(x) = g(x)h(x) is a primi- 
tive polynomial in R[x]. 


Proof. We prove this by contradiction. Let 


h(x) = cpa* + -++ +9, 
f(x) =anx” + +++ +49. 


Suppose that f(a) is not primitive. Choose a prime p that divides 
BCd (Gigs. «5.5 Wp): 

Now g(x) is assumed to be primitive, so not all of its coefficients 
are divisible by p. Let 7 be the smallest value such that 6; is not 
divisible by p. 

Similarly, h(x) is assumed to be primitive, so not all of its coefhi- 
cients are divisible by p. Let j7 be the smallest value such that c; is 
not divisible by p. 

Consider the coefficient aj+; of f(x). This coefficient is given by 


i4g = (bitje0 + bi45_-1¢1 + +++ + bj41¢;-1) 


+bjej + (O16 eee Die gy Ope) +e 9): 


Now aj; is assumed to be divisible by p. The “c” coefficients 
in every term in the first parenthesized expression are assumed to 
be divisible by p. The “b” coefficients in every term in the second 
parenthesized expression are assumed to be divisible by p. Hence 
the remaining term, b;c;, must be divisible by p. But p is a prime, 
so that implies that b; is divisible by p or c; is divisible by p; 
contradiction. 
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Definition 3.11.3. Let f(z) = anx" + --- + ap be a nonzero 
polynomial in R[x]. The content c(f(x)) is c(f(x)) = gcd(ao, ..., 
Dads © 


Lemma 3.11.4. Let f(x) € Riz], f(x) # 0 and let d = c(f(a)). 
Then f(x) = (1/d) f(x) is a primitive polynomial in Riz]. 


Proof. If d = gcd(ao, ..., an), let a9 = dap, ..., Gn = da,,. Then 

apy, -..-., a} is a relatively prime set of elements of R, so f(x) = 
0 n 

(1/d) f(a) =a},a" + +++ +a is a primitive polynomial in R[z]. 


Lemma 3.11.5. Let f(x), g(x) € Riz], f(x) £0, g(x) £0. 
Then c(f(x)g(x)) = e(f(@))e(g(«)). 


Proof. Let c = c(f(x)) and d = c(g(x)) and write f(x) = cf 
g(x) = dg(x) as in Lemma 3.11.4. Then h(x) = 
(cd) f(x)g(a). Let e & c(h(x)) and write h(x) = eh(x 


lemma. Then 


— 
8 
YS 


eh(x) = (cd)(f(x)9(2)). 


Now by Gauss’s lemma (Lemma 3.11.2), f(x)g(x) is primitive. 
Thus the right-hand side has content cd, while the left-hand side has 
content e, so we must have e = cd. 


Now suppose h(x) € Rix] and h(x) is a product h(x) = f(x)g(z) 
with f(x), g(x) € Fa]. We would like to conclude f(x), g(x) € R[z]. 
For example, in case R = Z, 2? +2 = x(x +1). But that can’t 
always be right as, again in case R = Z, we also have x7 +2 = 
(2x)(1/2x + 1/2). However, this is the only sort of thing that can go 
wrong. 


Corollary 3.11.6. Let h(x) € Riz] and suppose that h(x) = 
f(x)g(z) with f(x),g(x) € Flax]. Then there are polynomials 
fi(z), g(x) € Ria] with h(x) = fi(x)gi(x), where fi(x) is a con- 
stant multiple of f(x) and gi(x) is a constant multiple of g(x). 


Proof. We may write h(x) = eh(x) with e € R and h(x) € Ri] 
a primitive polynomial. We may write f(x) = cf(x) with c € F and 
f(z) € R[a] a primitive polynomial, and g(x) = d(x) with d € F 
and g(x) € R[x] a primitive polynomial. Then eh(x) = cdf (x)g(x). 
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Again, by Gauss’s lemma, f(x)g(x) is a primitive polynomial in 
R{x], so we must have cd € R. Then, as in the proof of Lemma 3.11.5, 
we have e = cd, i.e., e = ucd for some unit u € R. 

Let fi(z) = f(r) = (/c)f(x) and m(x) = (e/du)g(x) = 
(e/u)g(z). Then fi (x) € Riz], g(x) € R[x] and 


fiteate) = (2) 1@) (E) ala) = (Z) Fleate) 
= f(2)g(v) = h(x) 


as claimed. 


Corollary 3.11.7. Let h(x) be a primitive polynomial in R[x]. Then 
h(x) is irreducible in R[x] if and only if h(x) is irreducible in F\z]. 


Proof. First of all, note that, since h(x) is primitive, h(a) has no 
nonunit constant factors in R[x], so any nonunit factor of h(a) in 
R{x] must be a nonconstant polynomial. 

Now if h(x) is irreducible in F's], i-e., if h(x) cannot be written 
as a product h(x) = f(x)g(x) with f(x), g(x) € Fla] nonunits, i-e., 
nonconstant polynomials, then h(a) certainly cannot be written as 
such a product h(x) = f(x)g(x) with f(x), g(x) € R[x] (as every 
polynomial in R[2] is a polynomial in F'[z]). 

On the other hand, if h(x) is not irreducible in F[z], so that 
h(x) = f(x)g(x) with f(x), g(x) € Fla] nonunits, i-e., nonconstant 
polynomials, then, by Corollary 3.11.6, h(x) = fi(x)gi(x) with fi (2), 
gi(z) € R[x] nonconstant polynomials, and hence nonunits, and so 
h(x) is not irreducible in R[x]. 


Now we arrive at the result to which we have been heading. 


Theorem 3.11.8. Let R be a UFD. Then the ring R[x] of polyno- 
mials in x with coefficients in R is a UFD. 


Proof. Once again we must first show that any nonzero polynomial 
h(x) in R[a] has a factorization into irreducibles, and then show that 
such a factorization is essentially unique. 

For the first step: Write h(x) = eh(x) where e = c(h(x)). Now 
e € R so e has a factorization into irreducibles (i.e., primes) e = 
up ... py with uy a unit and pi, ..., py primes. Then h(x) € F[z] 
and we know that F'[z] is a UFD, ie., h(x) has a factorization into 
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irreducible in Fz]. But putting Corollary 3.11.6 and Corollary 3.11.7 
together, we see that h(x) has a factorization into irreducibles in R[z}, 
1.63; 
h(x) = u2fi(x) ... fm(a) with each f;(x) € R[x] irreducible 
and ug a unit in R[x], ie., a unit in R 
and then, if vu = ujue, a unit in R, 


h(x) = up. ... pefi(z) ... f(x) 


is a factorization of h(x) into irreducibles in R{z]. 
For the second step: Suppose we have two factorizations 


A(z) = upy ..- Pefi(Z) --- fm(2)- 


and 


A(x) = vq .-- qgi(@) --- Gn(z). 


First notice that, since h(x) is primitive, each f;(x) is primi- 
tive, and so p; ... p, = c(h(a)). Similarly, each g;(x) is primitive, 
so q....q & c(h(x)). Hence p, ... py = wq ... q for some unit 
w € R. But by unique factorization in R, we must have | = k, and, 
after possible reordering, p; = q; fori =1,..., k. 

But these are two factorizations of h(x) in R[x], hence in Fz]. 
Since each factor f;(x) is irreducible in R{z], it is irreducible in F [x 
(Corollary 3.11.7) and similarly, since each factor g;(x) is irreducible 
in R[x], it is irreducible in F'[z]. But we have unique factorization in 
Fz], som =n, and, after possible reordering, f(x) = gi(x) in Fix 
fori=1,..., m,ie., fi(x) = uig;(x) for some unit u; € F’. But f(z) 
and g;(z) are both primitive polynomials in R[x], so u; is a unit in 
R, in which case f;(x) = g;(x) in R[x], and we are done. 


Corollary 3.11.9. Let R be a UFD. 


(a). For any n, Rlei,...., %,| is a UPD. 
(b) For any set of variables {x;}, R[{x;}] 1s a UFD. 


Proof. 


(a) Since Alyiysc5%,| = Ctl; ...5 e-i))le_|, this follows 
directly from Theorem 3.11.18 by induction. 
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(b) Consider any polynomial in R[{z;}]. This can only involve 
finitely many variables, so must be in R[x, ..., 2,] for some 
n. Also, any factor of it must be in R[x, ..., 2]. So, by part 
(a), this polynomial has an essentially unique factorization in 


Ri{ai}]- 


Remark 3.11.10. If R is a UFD that is not a field, and {;} has at 
least one (and possibly infinitely many) elements, then R[{x;}] is a 
UFD that is not a PID. 

If R is a field, and {x;} has at least two (and possibly infinitely 
many) elements, then R[{z;}] is a UFD that is not a PID. 0) 


Remark 3.11.11. If R is Noetherian, and {x;} is finite, then R[{;}] 
is Noetherian. This is the Hilbert basis theorem, Theorem 3.3.13. 
If R is not Noetherian, the R[{x;}] is certainly not Noetherian. 
If R is Noetherian, and {2;} is infinite, then R[{x;}] is not Noethe- 
rian. This is Example 3.2.17. 


Now we turn to practical questions about factorization of poly- 
nomials in R[x]. First we can ask when such a polynomial has a root 
in R, or in F’. This question is easy to answer. 


Lemma 3.11.12. Let R be a UFD. 
(a) Let f(x) € Rix] be a monic polynomial, 


f(z) =a" + an_iz™ 1+ --- +4. 


Then any root of f(x) in F must be an element s of R. Further- 
more, s must divide ag in R. 
(b) Let f(x) € Ria] be arbitrary, 


f(x) =anx” + Oe ae st HG, 


Then any root s/t of f(x) in F with s/t in lowest terms (i.e., 
s€R,te Rands andt relatively prime) must have s dividing 
ag in R and t dividing ay in R. 


Proof. Note that (a) is a special case of (b). If (b) is true, and 
f(x) is monic, then t must divide 1 in R, ie., t is a unit in R, and 
so s/t is an element of R, and s/t divides ag if and only if s divides 
ag. Thus, we need only prove (b). 
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Let f(x) have root s/t with s and t relatively prime. Substituting, 
0 = an(s/t)” +an_1(s/t)”- +--+ +a1(s/t)+a9. Multiplying through 
by t”, 


=a 4 ae Ot ee et se opt” 


Now the left-hand side, and every term on the right-hand side, 
except possibly the last, is divisible by s. So the last term, aot”, 
must be divisible by s as well. We are assuming that s and t are 
relatively prime, so s and t” are relatively prime as well, and so s must 
divide ag. 

Similarly, by looking at the first term on the right-hand side, we 
see that t must divide ap. 


Lemma 3.11.12 was first thought of in the case R = Z, F = Q 
(long before people thought about UFD’s in general) so it is often 
called the rational root test. 

The second question we can ask is when a polynomial in R[z] is 
irreducible. In general, this is a difficult question to answer. But we 
do have the following very useful criterion. 


Lemma 3.11.13 (Eisenstein’s criterion). Let R be a UFD. Let 
h(x) = dna" + +++ + a9 € R[x] be an arbitrary polynomial. Suppose 
there is some prime p in R such that: 

(i) p does not divide an, 

(ii) p divides an_1, ..-, Qo, 
(iii) p? does not divide ao. 


Write h(x) = c(h(x))h(x), so that h(x) is a primitive polynomial 
in Ria]. Then h(x) is irreducible in Fx] (and hence in R{[z]). 
In particular, if h(x) is a primitive polynomial in R[x] then h(x) 
is irreducible in F|a] (and hence in R[z]). 
Proof. Note, by Corollary 3.11.7, that h() is irreducible in Fz] 
if and only if it is irreducible in R[z]. 
We prove this theorem by contradiction. Let 
(2) = Dye + +p 
gle) = ca" +--+ 


with m,k > 1 and suppose h(x) = f(x)g(x). We may assume 
f(x), g(x) € Riz}. 
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Observe that a9 = boco. Then, from conditions (ii) and (iii), we 
see that p divides exactly one of bg and co. Suppose that p divides bo 
but not co. 


We claim that p divides bo, ..., bm. We prove this by induction. 
In case 7 = 0, we are assuming p divides bg. 
Now suppose p divides bo, ..., b;-1 and consider b;. We see that 


ay = boc; + bicgy_-1 + +++ + Bj-1¢1 + Bicep. 


Now p divides a; (by condition (ii)) and, by the inductive hypoth- 
esis, p divides every term on the right-hand side except for possibly 
the last one, so p must divide the last term b;co as well. But p does 
not divide co, so p must divide 0;. 

Now note that an = bmcp, so, since p divides b,,, p must divide 
dn. But this contradicts condition (i). 7 

Thus, it is impossible to factor h(x), i.e., h(x) is irreducible. 


Example 3.11.14. Let p be a prime and let n be any positive inte- 
ger. Then h(x) = x” — p is an irreducible polynomial of degree n in 


Z[x] (or Q(z). 0 


Remark 3.11.15. The integers Z are a UFD, and the polynomial 
rings Z[z] and Q|z] are UFDs. In a UFD, as we know, primes and 
irreducibles are the same. Nevertheless, it is common to refer to inte- 
gers as prime and polynomials as irreducible. This is an accident of 
mathematical history. © 


3.12 Ideals: Maximal and prime 


We now let R be an arbitrary commutative ring with 1. We want to 
consider two kinds of ideals in R. 


Definition 3.12.1. An ideal J of Ris maximal if I ~ R and there is 
no ideal J of R with Ic J C R. Equivalently, J is maximal if JC R 
and if J is an ideal of R such that JC J, then J=JorJ=R. 9 


Definition 3.12.2. An ideal IJ of R is prime if whenever a,b € R 
with ab € 1, thena€ J or bel. © 
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These two notions are closely related. 


Lemma 3.12.3. Let I be an ideal of R. If I is maximal, then I is 
prime. 


Proof. Let J be a maximal ideal of R. Suppose a,b € R with 
ab € I. We need to show that a € I or bE T. 

If a € I we are done. Suppose not. Let J be the ideal generated 
by a and I. Concretely, J = {ra+i|reR,ie I}. NowI Cc J, so, 
since J is maximal, we must have J = R. In particular, J € J, so 
1=roa+io for some r € R, ig € J. But then 


b=b-1 = b(roa + ig) = ro(ab) + bio 


But ab € J by assumption, and ip € J. Thus b € J, as required. 


As we shall see, the converse of this result is sometimes, but not 
always, true. But first, an easy observation. 


Lemma 3.12.4. The ideal I = {0} of R is a prime ideal if and only 
if R is an integral domain. 


Proof. By definition R is an integral domain if it has no zero divi- 
sors, i.e., if a and b are elements of R with ab = 0, then a = 0 or 
b=0. 


We have used the word “prime” before, in connection with ele- 
ments of an integral domain. We are using it again, in connection 
with ideals. Since we are using the same word in two different. con- 
texts, we would expect these to be a close relation between the two, 
and there is. 


Lemma 3.12.5. Let R be an integral domain and letr € R, r #0. 
Then the principal ideal I generated by r is a prime ideal if and only 
ifr is a prime in R. 


Proof. Note that J # Rif and only ifr is not a unit. But J consists 
exactly of the multiples of r, so a € J if and only if r divides a, and 
b € I if and only if r divides b, and ab € I if and only if r divides ab, 
and so we see the two conditions are equivalent. 


Example 3.12.6. Let R = Z{z]. As we have seen, 2 and x are both 
primes in R, so if J, is the (principal) ideal generated by 2, and 


242 An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


In is the (principal) ideal generated by x, then J; and J, are both 
prime ideals in R. But they are not maximal ideals, as they are both 
contained in the ideal J generated by {2,a}. We have already seen 
that J = {polynomials a,x” + --- + ao in Z[z] with ag even}, so J 
is indeed a proper ideal in R. Now J is in fact a maximal ideal in 
R, as we now see: Let f(x) be any polynomial not in J, i.e., any 
polynomial with odd constant term. Then it is easy to check that we 
can write 1 = f(x)g(x) + h(x) for some polynomial g(x) € R and 
some polynomial h(x) € I. But then if Kk is the ideal generated by 
f(x) and J, we have K = R. ©) 


Thus, in general not every prime ideal is maximal. But in one 
important case it is. 


Lemma 3.12.7. Let R be a principal ideal domain and let I be a 
nonzero prime ideal of R. Then I is maximal. 


Proof. In this case, by Lemma 3.12.5, J is generated by a prime 
pe R. Leta e€ R,a € I. Let J be the ideal generated by a and J. 
We need to show that J = R. Let d = gcd(a,p). Now p is a prime, 
and p does not divide a (as a ¢ I), so a and p are relatively prime, 
ie., d= 1. But R is a PID, so we know we can write 


1 =ab+ pq for some b,qg € R. 


But that implies 1 € J and hence J = R. 


We now return to the situation of a general commutative ring 
with 1. 


Theorem 3.12.8. Let R be a commutative ring with 1 and let I be 
an ideal of R. 


(a) I is a prime ideal if and only if the quotient R/I is an integral 
domain. 
(b) I is a maximal ideal if and only if the quotient R/I is a field. 


Proof. Let 7 be the quotient map 7: R > R/I, so x(a) =a+I. 
For simplicity, we will write 7(a) = [a]. Observe that [a] = 0 if and 
only if a € I. 


(a) Suppose that I is a prime ideal and let [a], [b] € R/J with [a][}] 
0. We need to show [a] = 0 or [b] = 0. Now [a][b] = [abd], 


SO 


Ring Theory 243 


0 = [a][b] = [ab] which is true if and only if ab € I. But Iisa 
prime ideal, so that implies a € I, in which case [a] = 0, or b € J, 
in which case [b] = 0. 

On the other hand, suppose J is not a prime ideal. Let a,b € R 
with a ¢ I, b ¢ I, but ab € I. Then [a] # 0, [b] 4 0, but 
[a][b] = [ab] = 0. 

(b) Suppose that J is a maximal ideal and let [a] © R/I, [a] # 0. 
We need to show that [a] has an inverse [b] in R/I. Nwae R 
with a ¢ I (as [a] 4 0) and J is maximal, so, as we have seen 
in the proof of Lemma 3.12.3, there is an element b of R and an 
element i of J with ab+i%=1. But then [a][b] = [1] in R/J, and 
[1] =1+ TJ is the identity element of R/TI. 

On the other hand, if J is not a maximal ideal, let J be an ideal 
of RwithI CJ CR. Leta ce J,a ¢ I. We claim [a] € R/I 
does not have an inverse. Suppose it did, [a][b] = [1] for some 
[b] € R/I. Then ab € 1+T7, ie., ab = 1+ for some element i 
of I, and then 1 = ab+ (-1). Nowa e€ J, —i € I and I C J, so 
—1€J; hence 1 € J and J = R, a contradiction. 


We will be using part (b) of Theorem 3.12.8 extensively, as it is 
a very effective method of constructing fields. 


Example 3.12.9. Let R = F [a] with F a field. Then R is a PID. 
Let p(x) € R be an irreducible polynomial, i.e., a prime in R. Let I 
be the ideal generated by p(x). Then J is a nonzero prime ideal in 
the PID R, and so, by Lemma 3.12.7, a maximal ideal in R. Then 
R/T is a field. .) 


We have seen that Z[z] is an example of a UFD that is not a 
PID, and also an example of a ring in which not every nonzero prime 
ideal is maximal. It is no coincidence that we used the same example 
twice! 


Theorem 3.12.10. Let R be an integral domain. The following are 


equivalent: 


CL) asa PID, 
(2) R is a UFD and every nonzero prime ideal of R is maximal. 


Proof. We have already shown that if R is a PID, then R is a 
UFD (Theorem 3.6.13), and also that every nonzero prime ideal of 
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R is maximal (Lemma 3.12.7), so we know that condition (1) implies 
condition (2). 

We must show that condition (2) implies condition (1). This is a 
long proof, and we will break it up into several steps. 

Let R be a UFD in which every nonzero prime ideal is maximal. 
Let I be an ideal of R. We must show J is principal. This is certainly 
true if J = {0}, so assume J is nonzero. 

Step 1: Since R is a UFD, it is a GCD domain (Lemma 3.6.17). 
Let g = gcd({ elements of [}). Write g = up}! ... pi* with u a unit 
and p, ..., Dz distinct primes. From Lemma 3.6.16 we see that there 
must be an element a; of J such that the highest power of p; dividing 
a, is pj’, an element ag of I such that the highest power of p2 dividing 


ag is ps’, ..., an element a, of J such that the highest power of pz 
dividing a;, is p,*. 
Then we see that g = gcd(a1, ..., ay). The point here is that even 


if the ideal J is not finitely generated, g is a gcd of a finite number of 
elements of J. (It may be that some of the a,;’s coincide, or that some 
a; divide another, in which case this set is redundant. We could be 
“neat” about it and discard the redundant elements, but we don’t 
have to-with or without redundancies, the point is that this set is 
finite.) 

Let J be the ideal of R generated by g. Since g divides every ele- 
ment of J, J C J. We will show that g € J, in which case J CJ. Thus, 
I = J, a principal ideal (as J is generated by the single element g). 

Let A = {a,, ..., ax}. If A consists only of a single element {aj}, 
then g = a, so in this case certainly J = J. 

The crucial case is when A has two elements, A = {a1, a2}, which 
we rename {a,b} for clarity. 

Step 2: Let A = {a,b}, and g = gcd(a,b). We claim g € I. 

Step 2a: Suppose a is a prime p. If p divides b, then g = a and 
once again J = J. 

Suppose that a does not divide b. Since we are assuming that a is 
a prime, that means that a and 0 are relatively prime, and so 1 = g, 
and hence J = R. Let Ip be the ideal generated by a and I, be the 
ideal generated by A = {a,b}. Since a does not divide b, Ip C I1, and 
since a,b € I, I, C I. Now a is a prime, so Jp is a nonzero prime 
ideal, and we are assuming that every nonzero prime ideal of R is 
maximal. Thus J; = R, and so J = R, and so I = J. We observe for 
future use that in this case, 1 = ar + bs for some r,s € R. 
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Step 2b: Suppose that a and 0 are relatively prime. Again in this 


~N fi fj 


case, 1 = g and hence J = R. Write a = qy' ... g;’ where q, ..., 9; 
are distinct primes (some subset of pi, ... pz). Since a and 6b are 
relatively prime, qg; and 6 are relatively for each 7. Thus, by step 2a, 


we have 
1 = qr; + bs; for some r;, 5; € R, 
for each i =1,..., 7. But then 
J 
1= [|r + bs;)f 
i=1 
Now observe that in this product, there is one term that has 
qi ‘ .. gf) © a asa factor, and every other term has b as a factor. 
Thus, we see that in this case, 1 = ar + bs for some r,s € R, so once 
again J = R, and so I = J. 
Step 2c: Let a and 6 be arbitrary. By considering common prime 
factors of a and b, we see we can write 


a= qi .. gir 


P= G7 sas gs 


where r and s are relatively prime, r and q' ... qf’ are rela- 
tively prime, and s and qi aie qi are relatively prime. Let ey = 
min(f1,91), ---, ee = min(f:, 9). Then g = qi' ... q;’. Again let Ip 
be the ideal generated by a and b. 
Now set 
a’ =qi% .,, gf" 
Cag us, 


and let I} be the ideal generated by a’ and b’. We observe that Jp = 
{gi |i € B}. 
Now a’ = a/g, b' = b/g, so a’ and B' are relatively prime. Hence, 
by step 2b, 1 = a'r + b's for some 71s € R, and then 
g=g9-1=g(a'r +0's) = (ga’)r + (gb’')s =ar+bs 


so g € I, and hence I = J. 
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Step 3: Let A = {a1, ..., ax}. We argue by induction on k. We 
saw that in case k = 1 there was nothing to prove, and the case 
k = 2 was step 2. Now assume that any ideal J for which the set A 
consists of k—1 elements is principal, and suppose that A consists of k 


elements. Recall that ged(a1, ..., az) = ged(ged(a1, a2), a3, ..., ax). 
By the k = 2 case, if gig © gced(aj,a2), then gig € I, so we may 
replace A by A’ = {g12,a3, ..-, @x}, a set of k—1 elements of I with 


a same gcd, g, so by the k —1 case g € I, J = J, and by induction 
we are finally done. 


3.13. Exercises 


1. Let R be aring. For a set X, let 
RX = {f:X + R} 
Define addition and multiplication in R* by (f + g)(x) = 
f(x) + g(x) and (fg)(x) = f(x)g(x). Show that R* with these 
operations is a ring. 
2. (a) Let X be a set and let P(X) be the set of subsets of X. 
Define addition and multiplication on P(X) by 


A+ B=(ANB®)U(A®% UB), the symmetric 
difference ofA and B, 
AB=ANB, the intersection of A and B. 


Show that P(X) with these operations is a ring. 

(b) Let R = Zy. Show that P(X) is isomorphic to R* as defined 
above. 

3. A Boolean ring is a ring B such that b? = b for every b € B. (For 

example, P(X) is a Boolean ring.) 

Let B be a Boolean ring. 

(a) Show that B is commutative. 

(b) Show that 2b = 0 for every b € B. 

(c) Let B be finite. For a nonempty subset A of B, let 


ba = II b. 
be A 


Let e = 4b, where the sum is taken over all nonempty 
subsets A of B. Show that eb = be = 6 for every b € B. 
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(Thus, a nontrivial finite Boolean ring is automatically a 
ring with 1.) 

(d) Give an example of an infinite Boolean ring that is a ring 
with 1, and one that is not. 


. Let R be an arbitrary ring. Let S = R x Z with the following 
operations 


(a,m) - (b, 7) = (a+b,m-+n) 
(a,m) - (b,n) = (ab + na+mb,mn). 


Show that S is a ring with 1. Note that R is isomorphic to the 
subring Ro = {(r,0)} of S. (However, even if R is a ring with 1, 
Ro is not a subring- with-1 of S.) 

. (a) Show that O(VD), as defined in Example 3.1.16, is a ring. 

(b) Show that O(VD) is an integral domain. 

. Prove Lemma 3.2.10. 

. We proved the Noether isomorphism theorems for groups. Prove 
the analogous theorems for rings: 


(a) (First isomorphism theorem) Let y: R > S be a ring homo- 
morphism. Then Im(y) is isomorphic to R/Ker(y). 

(b) (Second isomorphism theorem) Let J and J be ideals in a 
ring R. Then I/IQ J is isomorphic to (J + J)/J. 

(c) (Third isomorphism theorem) Let J and J be ideals in a ring 
R with J CI. Then R/I is isomorphic to (R/J)/(I/J). 

(d) (Correspondence theorem) Let J be an ideal in a ring R. 
Then there is a 1—1 correspondence between {ideals of R 
containing J} and {ideals of R/J} given by I > I/J. 


. Let R be an integral domain that has the structure of a finite 
dimensional vector space over some field F. Show that R is a 
field. 

. Let F be a field and let a and 6 be fixed elements of F. Let 


10 0 —b 
be | and c=| | 
Ol 1-a 


Let R= {sl +tC | s,teF}. 
(Note that R is a 2-dimensional F-vector space. We may consider 
F C R by identifying s € F with sI € R.) 


(a) Show that R is a commutative ring with 1. 
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(b) If the quadratic polynomial p(x) = ax? + bx + € does not 
have a root in F, show that R is a field. 
(c) Show that C is a root of p(x) in R, ie., that p(C) = 0. 


10. The center C(R) of a ring R is 
C(R) ={réR|rs=sr for all s € R}, 


ie., the center of R is the subring of R consisting of those ele- 
ments of R that commute with every element of R. 
Let A be an arbitrary commutative ring with 1, and _ let 
R = M,(A) be the ring of n-by-n matrices with entries 
in A, with the usual matrix operations. Show that C(R) = 
{scalar matrices in M,,(A)}. 

11. Let G be an arbitrary group and let R be a commutative ring 
with 1. The group ring of G with coefficients in R is 


R|G] = s rgg | only finite many r, 4 0 
geG 


with operations 


» rgg + oy SgJ = Sor + 89)g 


geG geG geG 
Yo 709 (= vt) = SO (resn)gh. 
g€G heG g,heG 


(a) The augmentation ideal I of R[G] is the kernel of the aug- 
mentation map e: RIG] > R given by e(Urgg) = Urg. 
Show that J is generated as an abelian group by {g — 1| 
g EG}. 

(b) Let {C;} be the sets of conjugacy classes of elements of G 
that contains only finitely many elements, and for each such 
conjugacy class Cj, let ¢; = Ugeo,g. Show that the center 
C(R[G]) is generated as an abelian group by {c;}. 


12. Let R be a ring with 1. 


(a) Show that R has a nontrivial proper left ideal if and only if 
R has a nontrivial proper right ideal. 


13. 


14. 


15. 


16. 
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(b) Show that R is a skew field if and only if the only left ideals 
of R are {0} and R, or, equivalently (by part (a)) if and only 
if the only right ideals of R are {0} and R. (If R is a skew 
field, the only two-sided ideals of R are {0} and R, but the 
converse of this statement is false, as we see from the next 
problem.) 


A ring is called simple if it has no nontrivial proper two-sided 
ideals. 


(a) Let F be a field and let R = M,,(F). Show that R is a simple 
ring. 

(b) More generally, let A be a commutative ring with 1 and let 
R= M,(A). Show that every two-sided ideal of R is given 
by R = M,(J) for some ideal I of A. 


(a) Let R bea ring with 1 and let r and s be elements of R with 
rs = 1. Of course, if sr = 1 then r and s are units, s = r~! 
and r = s~!. Suppose that sr 4 1. Show that neither r, s, 
nor sr are units. 

(b) Give an example of this situation. 

Let R be aring. An element r € R, r 4 0, is nilpotent if r* = 0 

for some k > 0. 


(a) If R is commutative, show that 
{nilpotent elements of R}U {0} 


is an ideal of R. 
(b) Give an example to show that (a) may be false if R is not 
commutative. 
An idempotent in a ring R with 1 is an element e 4 0 or 1 with 
2 
e* =e. 


(a) If e is an idempotent of R show that f = 1 —e is also an 
idempotent of R with ef = fe =0. 


(b) A set {e1, ..., en} of elements of R is a complementary set 
of idempotents of R if e? = e; for each i and ee; = eje; = 0 
for i # j. If {e1, ..., En} is such a set, show that 


R= Re, @... @ Rey as left ideals, 
R=e,R@... Ge, as right ideals. 
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(c) An idempotent e of R is central if e € C(R), the center of R. 
If e is a central idempotent of R, show that eR = Re = eRe 
is a subring of R, and is a ring with 1. If {e1,..., en} isa 
complementary set of central idempotents of R, show that 


R=e,Re, 6... Gen Rep as rings. 


17. (a) Let G be a cyclic group of order n, which we write multi- 
plicatively as {1,9, ..., g” 1}. Let R = C[G] be the complex 
group ring of G. Set ¢ = exp(27i/n) € C. Let 


= 

ki i 
=-) tor k= 0,.«+- —1. 
€k n2ue 9 or ’ » 


Show that {e9, ..., €n—1} is a complementary set of central 
idempotents of C[G]. 

(b) Let n be odd and let G be the dihedral group G = 
Do, of order 2n. Following our previous notation, G = 
{1,a,...,a"-1, B,a8,...,a”- 18}. Let H be the subgroup 
H ={l,a,..., a” *} of G. Set ¢ = exp(27i/n) € C. Let 


== tg for R= 1.004 (e— 172. 


gcH 


Show that {eo,¢1, f1,---; f(n—1)/2} is a complementary set of 
central idempotents in C[G]. (There is a similar, but slightly 
more complicated, formula in case n is even.) 

18. Let R = Z. In each case, find d = ged(a,b), and express d in the 
form d = argo + bsg for some ro, so in R. 


(a) a= 25,b= 18 
(b) a = 1223,b = 541 
(c) a = 12599, b = 8557 


19. 


20. 


21. 


22. 


23. 


24. 


25. 
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(d) a = 126749, b = 28805 
(ec) a = 1079909, b = 404689 


Let R = Z{i]. In each case, find d = gcd(a,b), and express d in 
the form d = arg + bso for some ro, 89 in R. 


(a) a=8+13i,b = 10+ 17i 
(b) a=1+431i,b =16 + 41i 
(c) a=14+1T7i,b = 26 — 31 


Let R = Q(z]. In each case, find d = gcd(a,b), and express d in 
the form d = aro + bso for some ro, S89 in R. 

(a) a=a++23 4 3a? + 4r + 2, b=234+2r+1 

(b) a= a2* + 2a3 — x? + Qe — 2, b=2°4+ 307 +243 

Let R be a PID. Let a,n € R with n £0, and let d = gcd(a,n). 
Show that the congruence az = b (mod n) has a solution if and 
only if b is divisible by d. In that case, show that the solution is 
unique (mod n/d). (In particular, if a@ and n are relatively prime, 
the congruence az = b (mod n) has a unique solution (mod n) 
for any 0.) 

(b) In case R = Z, suppose that 6 is divisible by d. Show that 
the congruence az = b (mod n) has exactly d solutions (mod n). 
Let R = Z. Consider each of the following congruences (mod 7). 
Find all solutions (if any) (mod n). 


(a) 162 = 9 (mod 25) 
(b) 182 = 3 (mod 47) 
(c) 95a = 21 (mod 683) 
(d) 352 = 77 (mod 140) 


(e) 652 = 91 (mod 120) 


Let R be a PID, let a,b € R and let d = gcd(a,b). Write a = da’ 
and b = db’. Suppose that d = arg + bso. Show that 


r=rot+t’=, s=so—ta,teR 


are solutions to d = ar + bs, and furthermore that all solutions 
to d=ar + bs are of this form. 

Let R = Z. Find all solutions to gcd(693, 819, 1001) = 693” + 
819y + 1001z with x,y,z in R. 

Solve each of the following systems of simultaneous congruences 
in Z. 
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26. 


27. 


28. 
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(a) x = 19 (mod 32) 

x= 9 (mod 45) 
(b) 2 = 87 (mod 163) 

x = 56 (mod 257) 
(c) x = 5 (mod 9) 

x = 7 (mod 19) 

x = 9 (mod 29) 


(d) 2 = 10 (mod 47) 
x = 17 (mod 55) 
x = 25 (mod 91) 

(e)  =3 (mod 5) 


x = 4 (mod 7) 
x = 5 (mod 11) 
x = 6 (mod 13) 


Solve the following system of simultaneous congruences 
in Zi]: 
v=3-i (mod 3 + 72) 


v=2+i (mod 4 + 5i). 


Let n be a positive integer. Show that the following are 
equivalent: 


(a) For every integer a, there is some positive integer t with 
ate(n)+1 = q (mod n). 

(b) For every integer a, and every positive integer t, at?(™+1 = 
a (mod n). 

(c) n is a product of distinct primes. 


(a) An integer is squarefree if it is not divisible by any per- 
fect square except 1. Show that for any integer k, there is a 
sequence of & consecutive integers none of which is square- 
free. 

(b) Call an integer n exactly divisible by a prime p if p divides 
n but p? does not. Show that for any integer k, there is a 
sequence of k consecutive positive integers 71, ..., 7, anda 
sequence of primes p), ..., py such that p; exactly divides x; 
but p; does not divide x; for 7 #1, for eachi=1,..., k. 


29. 


30. 


ol. 


32. 


33. 


34. 


35. 
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Prove the following generalization of the Chinese remainder 
theorem: 


Theorem. Let R be a commutative ring with 1, and let I, and 
Ig be two ideals of R. Let ry; and rg be two elements of R. Then 
the system of simultaneous congruences 


x =r; (mod l;) 


x =rg (mod I2) 


has a solution if and only if 7; = rg (mod i; + Jy). 

If that is the case, and x = b is any solution, then z = Db! is a 

solution if and only if b' = b (mod I, I). 

Let F be a field and let R = F[z]. Let a and b be positive integers. 

Show that ged(x* — 1, 2° — 1) = a9e4ab) — 1, 

Let R be a PID. 

(a) If a = b (mod n1) and a = 6b (mod ng), show that a = 
b (mod lem(n1, 72)). (In particular, if n; and nz are relatively 
prime, a = b (mod nyng).) 

(b) If a = b (mod n1) and a = c (mod ng), show that b = 
c (mod ged(n1,n2)). (In particular, ifn; and nz are relatively 
prime, this gives no information.) 


Let p(x) be any polynomial with integer coefficients. Show that 


for every positive integer n. 

Let R be an f-GCD domain and let a, b,c,d € R with gcd(a,c) = 

gcd(a, d) = gced(b, c) = ged(b, d) = 1. Show that gcd(ab, cd) = 1. 

(a) Let R be an f-GCD domain. Let a,b € R and suppose 
that a’ divides b* for some positive integer k. Show that 
a divides b. 

(b) Give an example of an integral domain R and elements a,b 
of R with a* dividing b* for some k > 0 but a not dividing b. 

(a) Let R be an f-GCD domain and let F be its quotient field. 
Deduce from the previous exercise that if r € R, and r = s* 
for some s € F, then s € R. (In other words, r is a k-th 
power in F if and only if it is a k-th power in R.) 
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36. 


37. 


38. 


39. 
AO. 


Al. 


An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


(b) Give an example of an integral domain R with quotient field 
F, and an element a of R that is a k-th power in F, but not 
in R. 

Let R be an f-GCD domain. Let a,b € R and suppose that ab 

is a k-th power in R, i.e., ab = c* for some c € R. If a and b are 

relatively prime, show that there are elements e and f of R such 

that a is an associate of e” and b is an associate of f*. 

Let F be a field and let R = F[z]. Let p(x) € R with the property 

that p(f) = 0 for every f € F. 


(a) If F is infinite, show that p(x) is the 0 polynomial. 
(b) If F is finite, give an example of a nonzero polynomial with 
this property. 


(a) Let R be a UFD that is not field. Suppose that R has only 
finitely many units. Show that if p(x) € Ria] is any noncon- 
stant polynomial, then p(r) is composite for infinitely many 
rer. 

(b) Give an example of a UFD R, that is not a field, and a 
nonconstant polynomial p(x) € Ria] such that for every r € 
R, p(r) is either 0, a unit, or a prime. 

Prove Lemma 3.5.7. 

(a) Let R be a UFD. Prove that two elements a and b of R are 
relatively prime if and only if they have no common prime 
factor. 

(b) Prove Lemma 3.6.16 and Lemma 3.6.17. 

Let R be an integral domain and let a and b be nonzero elements 

of R. We say that alb®© if a|bN for some N. Write a ~ b if a|b%° 

and bla®. 

(a) Show that ~ is an equivalence relation on R — {0}. 

(b) Let R be an f-GCD domain. If a|b® and c is relatively prime 
to b, show that c is relatively prime to a. Conclude that if 
a ~ b, then c is relatively prime to a if and only if c is 
relatively prime to b. 

(c) Suppose that R is a UFD. Let a and b be nonzero elements 
of R. Show that we can write a = daa, b = d,b with da ~ dp, 
a and 0 relatively prime and @ and 6 each relatively prime 
to both dg and dj. In the notation of Lemma 3.6.16, identify 
dq, a, dp, and b. 


42. 


43. 


4A, 


45. 
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Let R be a UFD. Let a, b, c € R with c ¥ 0, and suppose 
that gced(a,b,c) = 1. Show that there is some d € R such that 
gcd(a + bd,c) = 1. 

Let R be an integral domain. Analogous to the notion of great- 
est common divisor (gcd) we have the notion of least common 
multiple (lcm). 


Definition. Let {a;} be a finite set of nonzero elements in an 

integral domain R. Then m is ar least common multiple (lem) of 

{a;} if 

(1) a; divides m for each 7; and 

(2) ifn is any element of R such that each a; divides n, then m 
divides n. 


Similarly to the situation with the gcd, if {a;} has a least common 
multiple m, then m’ € R is an lem of {a;} if and only if m and 
m’ are associates. 


(a) Let R be a UFD. In the situation, and notation, of Lemma 
3.6.16, let m; = max(fi,91),...,mj; = max(fj,g;). Show 
that 


lemma, bp ose PPO, xs ai ree, 

(b) More generally, let R be an f-GCD domain. Let a and b be 
nonzero elements of R. Show that a and b have an lcm, and 
moreover that lcm(a, b) = ab/ged(a, b). 

Let R be an f-GCD domain and let a and b be nonzero elements 

of R. 


(a) Show that for any nonzero element c of R, 
Icm(ca, cb) = c- lem(a, b). 


(b) Let ¢ be any nonzero element of R that divides both a and 
b. Show that lem(a/c, b/c) = lem(a, b)/c. 

(Compare Lemma 3.5.24). 

Let R be an f-GCD domain and let B and C be finite sets 

of nonzero elements of R, each of which has an lcm. Let 

A = BUC. Show that A has an lem, and moreover that 

Icm(A) = Iem(Iem(B), lem(C)). (Compare Lemma 3.5.21.) Note 
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that, by induction on the number of elements of A, this shows 

that any finite set of nonzero elements in an f-GCD domain has 

an lcm. 

Let {a1,..., @,} be a finite set of nonzero elements in an f- 

GCD domain R. Let A be the product A = aj, ..., Gyn. Show 

that lem(a1, ..., @n) = A/ged(A/ai, ..., A/an). 

Derive the method of “partial fractions” as follows: 

(a) Let R be a PID with quotient field F. Let s € F, s ¢ R, 
and write s in the form s = a/b, where b has prime factor- 


aon Dp" ax. ee with pi, ..., pe distinct primes and 
€1,-.., Ck positive integers, and a and 6 relatively prime. 
Show that 

ay ak 

Sa bt ee 

Py Pr 
for some elements aj, ..., ax of R with a; and p; relatively 
prime, for each i = 1,..., k. 


(b) Now suppose that R is a Euclidean domain with Euclidean 
norm 6. Let p be a prime in R and let a be an element of 
R that is relatively prime to p. Let e be a positive integer. 
Show that 


ay de 

p Pp 
for some elements ag, a1, ... Ge of R, with aj; = 0 or d(a;) < 
d(p) for each 7 = 1, ..., ¢. 


(c) Assembling (a) and (b), let R be a Euclidean domain with 
Euclidean norm 6, let F be the quotient field of R, and let 
s€F. Ifs ¢ R, write s =a/b as in (a). Conclude that there 


is an element co of R, and for each i = 1,..., & there are 
elements c7 of 2,9 = 1,52), with Gy = 0-or O(G7) < 
6(p;), such that 
k ey 
Cc 
ssat yy 
i=1 j=1 Pi 


(If s € R, we just have the expression s = cg with cp = 5 € 
RS 


. (a) Find a partial fraction decomposition of 4321/6000 in Q. 
) 


Find a partial fraction decomposition of (x* + 1)/((a + 
1)?(a3 — x + 2)) in Q{z]. 


AQ. 


50. 


51. 


52. 


53. 


54. 


55. 
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a1 


A vector vEZ", v=] : |, is primitive if gcd(ai,..., @n)=1. 


an 

Show that the group GL,(Z) acts transitively on 

{primitive vectors in Z"}. 

(Observe that this is equivalent to the claim that for any prim- 

itive vector v € Z”, there is a matrix A € GL,(Z) whose first 

column is v.) 

(a) Let p be a prime. Show that GL,,(Z,) acts transitively on 
{nonzero vectors in Z?}. 

(b) Use (a) and an inductive argument to determine |GL,,(Z,)|. 

Let p be a prime congruent to 1 (mod 4), and let a and b be 

integers with a? +b? = p. Find an explicit isomorphism y: Z,) > 

Z|t|/I where I =< a+bi > is the principal ideal of Zi] generated 

by a+ bi. 

(a) Show that 6(z) = |z2| is a multiplicative Euclidean norm 
on O(,/—2). Conclude that O(./—2) is a Euclidean domain, 
and hence a PID. 

(b) Show that 6(z) = |z2| is a multiplicative Euclidean norm on 
O(/2). Conclude that O(/2) is a Euclidean domain, and 
hence a PID. 

(a) Show that any prime p that is congruent to 1 or 3 (mod _ 8) 
can be written as p = a? + 2b? for some integers a and b. 

(b) Show that any prime p that is congruent to 1 or 7 (mod 8) 
can be written as p = a? — 2b? for some integers a and b. 

Let U(D) be the group of units in O(VD). 

(a) Show that U(—1) = {+1, +i}. 

(b) Show that U(—3) = {+1, (+1 + V—3)/2}. 

(c) Show that U(D) = {+1} for D < 0, D# —1,-3. 

(d) It is known that U(D) is infinite for D > 0. Find a unit other 
than =e] im UD) tor D = 2, 3,5,6;7. 

(e) Show that, for D > 0, every unit of U(D) other than +1 is 
an element of infinite order in U(D). 


Prove the following general test, which we can often use to show 
that O(VD) is not a UFD. 


Theorem. Let R = O(VD). If there is some prime p € Z such 
that 


(1) R does not have an element 8 with norm 6(3) = p, and 
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(2) R has an element a that is not divisible by p, but with norm 
d(a) divisible by p, 


then p is irreducible but not prime in R. Consequently, R is not 

a UFD. 

Use this test to show that O(VD) is not a UFD in the following 

cases: 

(a) D<0, D4 —2, D is even, p= 2. 

(b) D<0, D#-1, D=3 (mod 4), p = 2. 

(c) D<0, D#—-7, D=1 (mod 8), p= 2. 

(d) D <0, D=5 (mod 8), D is composite and p is the smallest 
prime factor of D. 

(e) D< 0, D=5 (mod 8), m = (1 — D)/4 is composite and p 
is the smallest prime factor of m. 

(f) D> 0, D is divisible by a prime g = 5 (mod 8), p = 2. 

(g) D > 0, D is divisible by a prime q, = 3 (mod 8) and by a 
prime q2 = 7 (mod 8), p = 2. 


O(VD) is a UFD (in fact, a PID) for D = —1, —2, —3, —7. Parts 

(a), (b), and (c) show that, except for these values, O(vD) 

is not a UFD for D < 0,D # 5 (mod 8). Parts (d) and 

(e) give partial information about the case D = 5 (mod 8). 

Gauss knew that O(VD) is a UFD (in fact, a PID) for D = 

1, —2, —3, —7, —11, —19, —43, —67, —163 and conjectured that 
these 9 values of D are the only negative values of D for which 
that is the case. That his conjecture is true is a deep and justly 
famous theorem of 20th century mathematics. 

Part (f) and (g) give very partial information about the case 

D > 0. Here Gauss conjectured that there are infinitely many 

positive values of D such that O(VD) is a UFD (in fact, a PID). 

This conjecture is still completely open. 

(While we have seen examples of UFDs that are not PIDs, we will 

see in Chapter 5 that rings of algebraic integers, which include 

O(VD), are UFDs if and only if they are PIDs.) 

(a) Let {fi(x), ..., fe(x)} be a set of primitive polynomials in 
Z|«]. If these polynomials have a common integer root r, then 
they are all divisible by the polynomial x — r and so are not 
relatively prime in Za]. Suppose these polynomials do not 
have a common integer root. Show that these polynomials 
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are relatively prime in Z[z] if and only if there is some posi- 
tive integer N such that gcd(fi(n), ..., fx(m)) divides N for 
every integer n. 
(b) If that is the case, let No(fi(x), ..., f,(a)) be the smallest 
such positive integer. Find No(a(x + 1), (a + 2)(a + 3)). 
Let p(x) € Q(z] be an arbitrary nonconstant polynomial. Show 
that there are infinitely many a € Q such that the polynomial 
q(x) = p(x) +a is irreducible in Q[z]. 
Prove Lemma 3.10.2, Lemma 3.10.4, Theorem 3.10.5, and 
Lemma 3.10.7. 
Let R be a commutative ring with 1. 


(a) Let M be a maximal ideal of R. Let a € R, a ¢ M. Show 
that for any b € R, the congruence ax = b (mod M) has a 
solution, and that that solution is unique (mod M). 

(b) Let P be a prime ideal of R. Leta € R,a ¢ P. Let bE R. 
Suppose that the congruence ax = b (mod P) has a solution. 
Show that that solution is unique (mod P). 

(c) Let P be a prime ideal of R that is not maximal. Show 
that there are elements a,b of R with a ¢ P, such that the 
congruence az = b (mod P) does not have a solution. 

(d) Give an example of (c). 


(a) Let R = Z[z]. For every n > 1, give an example of an ideal in 
R that is generated by n elements, but not by n—1 elements. 
Prove your example is correct. 

(b) Same for R = Q[z, y]. 

Show that every maximal ideal M in Z[z] is M =< p, f(x) > 

where p is a prime in Z and f(x) is a polynomial that is irre- 

ducible (mod p). 

Let R be a commutative ring with 1. 


Definition. R is a local ring if it has a unique maximal ideal MV. 


(a) Show that if M = {nonunits of R} is an ideal, then R is a 
local ring with maximal ideal M. 

(b) Suppose that R is an integral domain with quotient field 
F, and for every x € F, x #4 0 or x! CE R. (In this 
situation, R is called a valuation ring of F.) Show that 
M = {nonunits of R} is an ideal of R (and hence, by (a), 
that R is a local ring). 


260 


64. 


65. 


66. 


An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


Let F be a field. A discrete valuation v on F is a function v: F* > 

Z that is onto Z, with the properties that 

(1) v(xy) = v(x) + v(y) for all zy € F*, ie, uv is a homo- 
morphism from the multiplicative group of F to the additive 
group of Z; and 

(2) v(x +y) > min(v(x), v(y)) for all z,y € F* with r+ y 40. 

Let R= {x € F* | v(x) > 0} U {0}. 

(a) Show that R is a ring. R is called the valuation ring of v. 

) Show that R is a valuation ring of F. 

(c) Show that x € R is a unit of R if and only if v(x) = 0. 

) If x € F with v(x) = 0, show that x € R (and hence, by (c), 

that x is a unit of R). 

For each & > 1, let M, = {x € R| v(x) > k}. 

(e) Show that Mj is the unique maximal ideal of R, and hence 
that FR is a local ring. 

(f) Show that every nonzero proper ideal of R is M;, for some k. 
Also, show that M, = (M,)* for each k. 

(g) Show that Mj; is the only nonzero prime ideal of R. 

(h) Show that v is a Euclidean norm on R. Conclude that R is a 
PID. 

(i) More precisely, let « € R with v(x) = 1. Show that M;, is the 
ideal generated by x", for each k > 1. 

(j) Show that M, > Mz D> Mz > ..., and that Ne Mi = {0}. 


Let Ro be a PID with quotient field F. Let p be a prime in Ro. 
Define vp: F* — Z as follows: Let x € F, x 4 0, and write 
x = p"a/b with a and 6 relatively prime to p. Then vp(x) = n. 


(a) Show that v, is a discrete valuation on F, with valuation ring 
R={x €F | « =a/b with 6 relatively prime to p} U {0}. 

(b) Let F = Q. Show that every discrete valuation v on F is 
U = Up for some prime p in Z. 

Let R be a valuation ring of a discrete valuation v. We have seen 

that R is a local ring with maximal ideal M, that M is the only 

nonzero prime ideal of R, and that M > M? > M? > ... and 

gree eae (0) 

(a) Give an example of an integral domain R, not a field, that is 
a local ring with maximal ideal M where M = M? = M? = 
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(b) Give an example of an integral domain R that is a local ring 
with maximal ideal M where M > M? 5 M® > ... but 
nM! ¢ {0}. 

(c) Give an example of an integral domain R that is a local 
ring with maximal ideal M that has a nonzero prime ideal 
PSM. 

67. Let f(x) =an9+aiz+... and g(x) = b9 +bi2 + ... be formal 
power series with integer coefficients and suppose that ag and 
bo are relatively prime. Let t be a nonzero integer. Let h(a) = 
co +c," + ... be a formal power series with integer coefficients 
with cp = agbp and h(x) = f(x)g(x) (mod t). 

(a) Show that for every n # 0 there are formal power series 
fr(x) and g,(x) with integer coefficients, with fo(x) = f(x), 
go(x) = g(x), and for n > 0: 


fn(z) = fn-1(z) + dtx” for some integer d, 


Gn(Z) = Gn—1(x) + etx” for some integer e, 


so that fn(x) = f(x) (mod ¢) and gn (a) = g(a) (mod t), and 
hence fr()gn(x) = h(x) (mod t), and furthermore that 
n(x 


fn(@)g 


For example, let f(x) = 3+ 2, g(x) =5+4+ 4a. Let t= 11 and 
let h(x) = 15 — 32 + x”. Then: 


) = h(x) + terms of degree > n. 


(3+2)(5+2) = (15 -32+27)+11z 


(3 — 21x)(5 + 34x) = (15 — 34 + 2”) — 7152? 


(3 — 21x + 1432") (5 + 34x) = (15 — 32 + 2”) + 48622° 
(3 — 21a + 1432? + 8802z°)(5 + 34a — 15429) 
= (15 — 3a + #”) + 116627 + 2200222° — 1355202° 


(b) (i) Let f(z) =2+2, g(x) = 2-7. Let t = 5 and A(z) = 
x? — 14. Find f,p(x), gn(x) for n = 0,1, 2,3. 

(ii) Let f(x) = 7x + 2, g(x) = 22 +2243. Let t = 7 and 

h(x) = 927411246. Find f(x), gn(x) for n = 0,1, 2,3. 
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Note that in a natural sense we may form 


felt) Sim 35557 (2) ond Oa = lis 0) 


and then we have fw(z) = f(r) (modt), go(r) = 
g(x) (mod t), and foo(®)goo(%) = h(x). (Even if we begin 
with f(x) and g(x) polynomials, as in the above examples, 
foo(x) and goo(x) will in general be formal power series but 
not polynomials. ) 


Fix a prime p. Let S = {(a1,a2,a3, ...)} be the set of infinite 
sequences with a; € Z for every 7. Define addition and multiplica- 
tion on S coordinatewise, i.e., (a1, a2, a3, ...) + (b1, ba, b3, ...) = 
(ay + 61,42 + bo,a3 + b3,... and (a1, a2,03,.. .)(b1, bg, b3 .. rs = 
(a1b1, a2b2, a3b3, ...). Observe that S is a commutative ring 
with 1. 

Let T be the subring of S defined by 


T = {(a1, a2, a3, ...) | ai41 = a; (mod p') for each i > 1} 
Let I be the ideal of T defined by 
I = {(a1, a2, a3, ...) | a; =0 (mod p;) for each 7 > 1} 


(a) Show that J is a prime ideal of T that is not maximal. Con- 
clude that R = T/T is an integral domain that is not a field. 
We denote R by Zp. R is called the ring of p-adic integers. 
While elements of R are usually just written as sequences, 
we will put brackets around them to make clear that they are 
really equivalence classes of sequences. Note that any r © R 
has a unique representative of the form 


(a1,@2,a3,...) with 0<a;< p’ for each i > 1. 


(This may or may not be the best representative to use.) 
(b) Show that y: Z > R by y(n) = [(n, n,n, ...)] is a 1-1 homo- 
morphism. (Thus, by identifying n with y(n), we may regard 
Z as a subring of R.) 
Show that [(a1,a2,a3,...)] is a unit of R if and only if 
a, # 0 (mod p). (The preceding exercise may be helpful. 
Otherwise, you can do this “by hand”.) 


— 
io) 
—— 
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Let v: R— {0} > Z by v([(a1, a2, a3, ...)]) =i—1 if2 is the 
smallest value of k such that a, 4 0 (mod p*). Show that v 
is a Euclidean norm on R. Conclude that R is a PID. 

Show that R is a local ring with unique maximal ideal 


M = {[(a1, a2, az, ...)] | a1 = 0 (mod p)} = {nonunits of R}. 
Show that every ideal of R is 
M; = {[(a1, 42,43, ...)] | a~ = 0 (mod p*) for k < i} 


and that M; = M*. ; 

Let F be the quotient field of R. We denote F by Q,. F is 
called the field of p-adic numbers. Show that every element 
of R (resp. F) can be written uniquely as 


p*[(a1, 42,43, ...)] with a; 40 (mod p) 


for some nonnegative integer (resp. integer) k. 
Show that as a set, R (and hence F) is uncountable. 


Prove the following lemma. 


Lemma (Hensel’s Lemma). Fix a prime p. Let f(z) = 
u,c¢;2" be a polynomial with integer coefficients. Let a be an 
integer. 


(i) Suppose that there is an integer x; such that f(x) = 
a (mod p) and f’(2#1) = q #0 (mod p). Then for every 
n > 1 there exists an integer x,, unique (mod p”), such 
that f(a) =a (mod p”) and &p41 = Lp, (mod p”). 

(ii) More generally, suppose that there is an integer x,, such 
that f (2m) =a (mod p™) and f’(am) = p*q, q not divis- 
ible by p, with m > 2k+1. Then for every integer n > m 
there exists an integer x,, unique (mod p”), such that 
f(@n) =a (mod p”) and n41 =Zn (mod oo"), 


Conclude the following corollary. 


Corollary. In the situation of Hensel’s lemma, there exists a 
unique r € Zp, r = [(a1, a2, a3, ...)] with am = %m (mod p™) 


and f(r) =a. 
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70. Fix a prime p. 
(a) Let a be an integer relatively prime to p. 


(i) Let m be a positive integer relatively prime to p. Show 
that a has an m-th root in Zs if and only if the congru- 
ence 7’ = a (mod p) has a solution in Z. 

(ii) Show that a has a p-th root in Z, if and only if the 
congruence 2? = a (mod p*) has a solution in Z. 


(b) In general, let a be a nonzero integer and let m be a positive 
integer. Find and prove a necessary and sufficient condition 
for a to have an m-th root in Zp. 


Chapter 4 


Field Theory 


We now turn our attention to fields. Fields, of course, are particu- 
lar kinds of rings, but field theory has a completely different flavor 
than ring theory. 

In field theory, we are principally interested in investigating fields 
E that are “extensions” of a field F. Simply put, E is an extension 
of F if F is contained in E. As we will see, we will often begin with 
‘and a polynomial p(x) with coefficients in F, and obtain E by 
“adjoining”, i.e., adding in, a root, or roots, of p(x) to F. (Of course, 
we will have to make this precise). But from this short description 
you can already see that field theory is intimately related to questions 
about roots of polynomials — indeed, this was its historical origin. 

Our study of field theory will culminate in Galois theory, where, 
as we will also see, group theory plays a fundamental role in studying 
field extensions. 


4.1 Definition, examples, and basic properties 


We begin by recalling the definition of a field, which we restate 
here for convenience. 


Definition 4.1.1. A field F is a commutative ring with 1 such that 
F* = F—{0} (ie., such that every nonzero element of F is a unit). > 


Let us begin by looking at some familiar (and not so famil- 
iar) examples. 


265 


266 An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


Example 4.1.2. 


a) The rational numbers Q are a field. 

) The real numbers R are a field. 

) The complex numbers C are a field. 

) C can be described as C = R (i) as in Example 3.1.15. 
) 


3.1.16 is a field. 

(f) For p a prime, Z, = {0,1,...,p—1} with addition being addition 
(mod p) and multiplication being multiplication (mod p) is a 
field. Henceforth we will denote this field by F,. © 


Note that if 7 € F,, then k = j~' is the element of F, with 
jk = 1. How can we find j~!? Let us see. 


Example 4.1.3. 


(a) In Fy;, it is easy to find inverses by trial and error. Certainly 
1-1=1. Then 2-'=6 as 2:6 =12 = 1 (mod 11), and so 6-1! =2. 
Also, 3-1 = 4 as 3-4 = 12 = 1 (mod 11), and so 47+ = 3. 
Next, 5-1 = 9 as 5-9 = 45 = 1 (mod 11), and 9"! = 5. Next, 
7-1 = 8, as 7-8 = 56 = 1 (mod 11), and so 8"! = 7. Finally, 
10 = —1 (mod 11), and hence (10)? = (—1)? = 1 (mod 11), 
giving 10~'=10. 

(b) Let p = 618421, a prime. We wish to find 1248167! in F,. Here 
we resort to Euclid’s algorithm. We computed in Example 3.7.2 
(e) that 

1 = 618421(53773) + 124816(—266427) 
so 

1 = 124816(—266427) (mod 618421) 
so 

124816! = —266427 (mod 618421) 
and easy arithmetic shows 

—266427 = 351994 (mod 618421) 
so 
124816~* = 351994 


in Fp. © 
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Given a finite field, (i-e., a field with a finite number of elements) 
we can write out its addition and multiplication tables. 


Example 4.1.4. Addition and multiplication in F7 are given by: 


aow»Rwnr Oo 
GOMOOoRWN FIR 
rFOoOm oR W Yb 
NOr OM oe ww 
WNFrF OM OB, 
rPwnNnre O® oyu 
oR WNF OAS 


0 
1 
2 
3 
4 
5 
6 


aoowrwnNmro!: 
eo o oo ©: 'o |S 
Ow1rkrwnr oO]F 
CTWrR Ha FW Ob 
er TOM Dw Olw 
WON Or FO] 
OP DE W OO] 
PNW HR OD OD 


We now define an important invariant of fields. 


Definition 4.1.5. Let F be a field. The characteristic char(F) is the 
smallest positive integer n such that n-1=0 € F, or 0 if no such 
positive integer exists. © 


Lemma 4.1.6. Let F be a field. Then char(F) = 0 or is a prime. 


Proof. Suppose n = char(F) 4 0 and is not a prime. Certainly 
n#las14#O0inF. Write n = ab, 1<a,b<n.Thena-140 € F 
and b-140 € F, but (a-1) (6-1) = ab-1=n-1=0 in F, which is 
impossible as any field is an integral domain. 


Example 4.1.7. 


(a) Q,R,C and Q(VD) are all fields of characteristic 0. 
(b) F, is a field of characteristic p. ?) 
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Now suppose F and F’ are fields. Then they are both commutative 
rings with 1, and a homomorphism y: F — F’ of fields is simply a 
homomorphism rings with 1. But in the case of fields we have a strong 
property (Lemma 3.1.23), which we restate (and reprove) here. 


Lemma 4.1.8. Let F and F’ be fields and let yp: F > F’ be a homo- 
morphism of fields. Then vy is injective. Consequently, yp is an iso- 
morphism if and only if it is surjective. 


Proof. Suppose that a € F, a 40. Then a has an inverse a~', with 
aa—' = 1. But then 1 = ¢(1) = y(aa“") = y(a)y(a“") so y(a) 4 0. 
Consequently, Ker(y) = {0} and y is injective. 


Definition 4.1.9. Let F be a field. A homomorphism vy: F-+F is an 
endomorphism of F, and an isomorphism y: F->F is an automor- 
phism of F. v) 


Example 4.1.10. 


(a) The inclusions i: Q > R, i: R > C, and 7: Q > Q(VD) are all 
field homomorphisms. 

(b) Let F = C. Then vy: FF by y(at+bi) = a—bi is an automor- 
phism of F. (This is just complex conjugation.) 

(c) Let F = Q(VD). Then y: F>F by y(a + bVD) = a— bVD 
is an automorphism of F. (This y is called conjugation in the 
field F.) .) 


Remark 4.1.11. We leave it to the reader to check that it is only 
possible to have a homomorphism y: F — F’ of fields if F and F’ have 
the same characteristic. © 


Remark 4.1.12. 


(a) Let F be any field of characteristic 0. Then we have a homo- 
morphism of rings with 1 i: Z > F given by i(1) = 1, which 
then forces y(n)=y(n-1) = n-y(1) = n-1 = n, and then i 
extends to a unique homomorphism of fields 7: Q — F given by 
o(n/m) = p(nm-}) = o(n)o(m-)}=9(n)o(m)-? = n/m for 
any n/m € Q (ie., with m # 0). 

(b) Let F be any field of characteristic p. Then we have a homomor- 
phism of rings with 1 i: Z, - F, i.e., i(1) = 1, which then forces 
y(n) = n for any n € Fp. .) 
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Definition 4.1.13. We identify Fo = Q with a subset of any field of 
characteristic 0 as in Remark 4.1.12(a), and Fo = F, with a subset 
of any field of characteristic p as in Remark 4.1.12(b). In either case, 
Fo is called the prime field. © 


We now turn our attention to fields of positive characteristic. 
Here is a (surprising) fact about arithmetic in fields of 
characteristic p. 


Lemma 4.1.14. Let F be a field of characteristic p and let a, b € F. 
Then (a+ b)P = aP + BP. 


Proof. By the binomial theorem, 


(a+b)?=S° (?) Py 


i=0 


Now (5 )-(2)=1 and, since p is prime, 


0 
(?) il - i)! 
(p — 1) 


is divisible by p for every i with 1 < i < p — 1. 


Lemma 4.1.15. Let F be a finite field and let yp: F>F be an endo- 
morphism. Then y is an isomorphism. 


Proof. A 1—1 function from a finite set to itself must be onto. 


Lemma 4.1.16. Let F be a field of characteristic p. Then ®: F>F 
by ®(a) = a? is an endomorphism of F. 


Proof. We clearly have (1) = 1 and ®(ab) = (ab)? = a?b? = 


®(a)®(b). But also ®(a + b) = (a + Bb)? = a + BH = (a) + B(d) 
by Lemma 4.1.14. 


Definition 4.1.17. The map ®(a) = a@ of Lemma 4.1.16 is 
the Frobenius endomorphism (or automorphism, as the case may 
be) of F. v) 


Lemma 4.1.18. Let F be a field of characteristic p. 


(a) If F=F,, then the Frobenius map ®: FF is the identity. 
(b) IfF # F,, then the Frobenius map ®: FF is not the identity. 
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Proof. 


(a) By Fermat’s little theorem (Theorem 2.7.1) a?~' = 1 (mod p) 
for every integer a that is not divisible by p, so a? = a (mod 
p) for every such integer, and this is certainly also true if a is 
divisible by p. 

(b) Consider {a € F | ®(a) = a}. This is the set of roots of the 
polynomial f(z) = x?—< in F. By this is a polynomial of degree 
p, so can have at most p roots in F, by Corollary 3.3.9. By (a), 
every element of F,, is a root of this polynomial, so if a € F, 
a¢ Fy, O(a) # a. 


Remark 4.1.19. Admittedly, the only field of characteristic p we 
have seen so far is F,, itself. But we will be seeing other fields of 
characteristic p later (even soon), and we are preparing for them by 
introducing the Frobenius map now. o 


Remark 4.1.20. We will see that for every prime p and every pos- 
itive integer n there is a field Fp» of p” elements (as well as infi- 
nite fields of characteristic p), and indeed that F,» is unique up to 
isomorphism. We take this opportunity to caution the reader that 
while F, = Zp, Fpn A Zyn for n > 1. Note that Zp is not an integral 
domain for n > 1 (as for n > 1, p £0 in Zpn and p”~! £0 in Zpn, 
but p-p”~! = p" = 0 in Z,n) so cannot possibly be a field. © 


4.2 Extension fields 


Definition 4.2.1. Let E and F be fields with F C E. Then F isa 
subfield of E, or E is an extension field (or simply extension) of F. > 


Example 4.2.2. R is an extension of Q. C is an extension of R, and 


of Q. Q(VD) is an extension of Q. © 


We now present a general method of constructing field extensions. 
Actually, we already saw this method in Example 3.12.9, but we 
present it again here. 


Theorem 4.2.3. Let F be a field and let p(x) € F[a] be an irreducible 
polynomial. Then the quotient F[z|/(p (x)), where (p(ax)) denotes the 
ideal of F |x] generated by p(x), is a field. 
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Proof. Let R = Flz] and let J = (p(a)). Recall that R is a 
Euclidean ring with norm 6 (p(x)) = deg p(z). 

Since R is a Euclidean ring, it is a PID, and in a PID every 
irreducible element is prime (Lemma 3.6.15). Thus p(x) € F{a] is 
prime, and so J is a prime ideal (Lemma 3.12.5). Again, R is a PID 
so every nonzero prime ideal of R is maximal (Lemma 3.12.7). But 
then R/T is a field (Theorem 3.12.8). 


As an immediate consequence of this theorem, we have the fol- 
lowing general result. 


Theorem 4.2.4 (Kronecker). Let F be a field, and let p(x) € F[a] 
be any nonconstant polynomial. Then there is an extension field E of 
F in which p(x) has a root. 


Proof. First assume p() is irreducible. Let E = F[x]/(p(a)) as in 
Theorem 4.2.3. We may regard (and have regarded) F as contained 
in F[z] by identifying a € F with the constant polynomial a. Since 
no element of F is in the ideal generated by p(x) (since p(x), being 
nonconstant, has positive degree) this identification gives us a 1—1 
map from F into E, and we regard F as a subfield of E via this 
identification. In other words, E is an extension of F. Let 7: F[{a] > 
F[a]/(p (x)) = E be the canonical projection, and set a = m(z). Then 


p(a) =p(a(z)) =7 (p(w) =OmE 


(as p(x) € (p(x))). In other words, a € E is a root of the polynomial 
p(x), and so we see that E is an extension field of F in which p(x) 
has a root. 

If p(x) is not irreducible, factor p(a) into a product of irreducibles, 
say p(x) = pi(a)..- p(). 

Then, as above, E = F[z]/(p1(x)) is a field in which the polynomial 
pi(x) has a root. But if a@ is a root of pi(z), ie., pi(a) = 0, then 
certainly p(a) = 0, i-e., a is a root of p(x). 


Now let us look at a couple of examples. The first one shows us 
how to recover an old example from this new viewpoint. The second 
is a new example, which, moreover, concretely shows us how to do 
arithmetic in extension fields. 
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Example 4.2.5. Let F = Q and let D € Q, D not a perfect square. 
Then the polynomial p(x) = «2 — D € Fiz] is irreducible. (If it 
were not, it would have a root in Q, which would make D a perfect 
square.) Let E = Qs] / (p(x)), and let 7: Q [2] > E be the canonical 
projection. Let m(a#) =a. 

First we observe that from the division algorithm, 
{f (x) = ap + 412 | ao, a; € Q} is a complete set of coset represen- 
tatives of (p(x)) in Q|z]. Then m(aop +a1%) =ap + aya. 

Next, and key, we observe that, as in the proof of Theorem 4.2.4, 
we have a? — D = 1(2?—D) = x(p(x)) = 0€ Q[a]/(p(z)). In 
other words, a? = D in E. Thus we have constructed a field, more 
precisely, an extension field of Q, in which D has a square root. © 


Now you might object that we already know a field in which D 
has a square root, namely Q(VD), and you would be right in that 
the field we have constructed in “essentially the same as” Q(VD), 
though we have to be careful about what we mean by “essentially 
the same as”. But the point of this construction is that we may use 
it to obtain fields that are new. 


Example 4.2.6. Let F = Q and let p(x) = x3 + 7z + 1, an irre- 
ducible polynomial in Qa]. (If p(a) were not irreducible, it would 
have to have linear factor, i.e., a root in Q, and it does not.) 

Let E = Q[z]/(p(ax)) and let 7: Q[z] — E be the canonical 
projection. Set (x) = a and note that a? + 7a +1 = a(x? + Tx + 
1) =0. 

Since E is a field, a must have an inverse in E, and it does (and 
we can find it): 


a? +7a+1=0 
-—a® —7a=1 
a (—a? _ 7) =1 
so a7! = —a? — 7. 
Let’s look at some other elements. Let 6; = a + 3 and 62g = 


a? + 1. Then 8; + Bg = a® + a + 4. More interestingly, 8182 
(a + 3)(a? +1) =a + 8a? + @ + 3. Now (2 + 3)(2? +1) = 2? + 
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32? + 2 + 3 and we simply compute 


a + 3a? +043 = (x? +72 +1) (1) + (3a? — 6x + 2) 


sO 
a® + 307 +a4+3= (a®+7a +1) (1) + (3a? — 6a + 2) 
=0(1) +30? ba +2 
= 3a” — 6a +2 
and we see 


B Bo = 307 — 6a +2. 


Now { is a nonzero element of E (as x + 3 is not divisible by x? 
+ 7x + 1) so it must have an inverse. To find it, we perform Euclid’s 
algorithm on these two polynomials. This only takes one step: 


a? + Tz +1 = (x +3) (x? — 324+ 16) — 47 


so, solving, we find 


sO 
a’? — 3a + 16 
47 : 


Similarly, $2 is a nonzero element of E, and we find its inverse 
from Euclid’s algorithm, which now takes two steps: 


By* = 


a? +7r+1= (27 +1)2+ (62 +1) 


a 37 
2 — (Ge + a ane 
zg’ +1=(62+1) (5 (6x 1) + 35 
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sO 
7.2 36 —6z +1 
i=i¢ +1) ($) +6r+y (= ) 
2 36 3 
=a) () +c ae 1) (d) 
—6r+1 
+ (2? +1) (-2)| (=) 
—6r +1 6x? — x + 36 
= (a3 + 7 +1) (SP) ean oo 
and then 
_1 _ 60 a+ 36 
By =—37,. > v) 


Guided by these examples, we will see how to do arithmetic in 
extension fields. But first we need to introduce some general notions. 


Remark 4.2.7. Let E be an extension field of F. We will (usually) 
denote elements of F by Roman letters (a, b,c,...) and elements of 
EK by Greek letters (a,3,y,...) % 


Here is an easy lemma, but one which plays a crucial role. 


Lemma 4.2.8. Let E be an extension fields of F. Then E is an F- 
vector space. 


Proof. Since E is a field, we may add any two elements a, and ag 
of E to obtain the element a; + a2 of E. Also, since F C FE, for any 
element a of F and any element a of E, we may multiply a by a to 
obtain the element aa of E. We leave it to the reader to check that 
with these definitions of vector addition and scalar multiplication, E 
becomes a vector space over F. 


This leads us to the basic invariant of field extensions. 


Definition 4.2.9. Let E be an extension field of F. The degree of E 
over F, (E/F), is the dimension of E as an F-vector space, (E/F) = 
dimy E. (The degree (E/F) is often denoted by [E: F].) 0) 
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Let us now see how to determine (E/F). 


Lemma 4.2.10. Let p(x) € Fl] be a nonzero polynomial. The set 


S={r(xz)€F(x) | r(xz)=0 or degr(z) < deg p(x)} 


is a complete set of left coset representatives of the ideal I = (p(x)) 


of F\a]. 


Proof. Let f(z)€F|z]. Then we have, by the division algorithm, 
that 


f (x) =p(x)q(x)+r(x) withr (x) = Oor degr (x) < degp (x) 


so f(z) Er(a) +1. 

On the other hand, ifr) (x) + J = re (x) +J, then re (x) — 
I, ie., ro(%) — r1 (x) is divisible by p(x). Since ri(z) = 0. or deg 
ri(z) < deg p(x), and r(x) = 0 or deg ra(x) < deg p(x), we see 
that ro (x) — r1 (x) = 0 or deg ro(x) — ri(x) < deg p(x). But the 
polynomial p(x) cannot divide a polynomial of lower degree, so we 
must have rg (xz) — r1 (x) = 0, ie., ro (a) = 11 (a). 


ry (2) € 


Theorem 4.2.11. Let p(x) € Fla] be an irreducible polynomial of 
degree n, and let E = F[z|Xp(x)). Then E is an extension of F of 
degree n. 


Proof. Let 7: F[z] — E = F{z]/(p(x)) be the canonical projection. 
Then, since the set S of Lemma 4.2.10 is a complete set of coset 
representatives of (p(x)), 7: S — E is 1-1 and onto. It is also a 
linear transformation, as 7(r1(x) + ro(x)) = a(r1(x)) + m(r2(x)) and 
m(ar(x)) = an(r(x)), since (p(z)) is an ideal of F[z]. Thus it is an 
isomorphism of vector spaces, so dimp E = dimp S. But S has basis 
{1,2,...,2”—1}, so dimg S = n. 


Corollary 4.2.12. In the situation of Theorem 4.2.11, let a = 1(2). 
Then {1,a,...,a~+} is a basis for E as an F-vector space. 


Proof. Since 7 is a vector space isomorphism, the image of a basis 
under 7 is a basis. 


Corollary 4.2.13. Let F, be the field with p elements, p a prime, 
and let E be an extension of Fy of degree n. Then E has p” elements. 
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Proof. An n-dimensional vector space over F,, has p” elements. 


Remark 4.2.14. We showed in the proof of Theorem 4.2.11 that 
ma: S— E is an isomorphism of F-vector spaces. But we should point 
out that the structure of an F-vector space is all the structure S' has. 
Namely, we may add elements of S and multiply elements of S by 
elements of F, so we indeed have an F-vector space structure on S. 
But that is all. To be sure, we can multiply two elements of S (after 
all, S is a subset of F[z], and we may multiply any two polynomials 
in F[z]), but the result will not necessarily be an element of S. Thus, 
S is not closed under multiplication, so is not a field. © 


However, we do have the following explicit description of arith- 
metic operations in E. 


Corollary 4.2.15. In the situation of Theorem 4.2.11, the arith- 
metic operations on E are given as follows: Let 81, B2 € E. 


If 6; = 0, then 6, + 61 = Be and 6162 = 0, and similarly if 
By = 0. 

Suppose that 6, and {2 are both nonzero. Write 6, = fi(a) with 
deg fi(a) < n and 62 = fo(a) with deg fo(a) < n. Then: 


(1) Let g(x) = fi(z) + fo(z). Then 8; + 62 = g(a). 

(2) Let A(z) = fi(x)fo(z). Write A(x) = p(x)q(x)4+r(xz) with 
r(x) = 0 or deg r(x) < n. Then 6,82 = r(a). 

(3) Let s(z) and t(x) be polynomials with p(x)s(x) + fi(x)t(z) =1. 
a t(x) = p(x)u(x) + v(x) with deg v(x) < n. Then By! = 


Proof. Let I be the ideal J = (p(x)), and recall that 8, =fi(a) = 
fi(a(x)) = m(f1(x)) = fi(x) + I and similarly for fo(x). Then: 
(1) (f1(@) + DT) + (fa(x) + 2) = (file) + fo(z)) + 2 = g(a) +I. 
(2) (fil@) + D(fal@) + DT) = fila) fale) + 

= (p(x)q(x) + oe 
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filzji(z) e147 

filx)(p(w)u(x) + v(z)) E147 
fi(x)v(z) €1+7 

m(fi(z)o(z)) =1Ee E=Fl2}/T 


Piv(a) = lin E. 


With this concrete description of arithmetic in E in hand, let us 
give one more example. 


Example 4.2.16. We construct a field Fy with 4 = 2? elements, or 
equivalently, by Corollary 4.2.13, an extension of F2 degree 2. 

We look for a monic irreducible quadratic in Fs[z]. Let p(x) = 
x? + ax + b with a,b € Fo yet to be determined. Now a quadratic is 
irreducible if and only if it does not have a linear factor, so we want 
p(0) #~ Oin Fp and p(1) #4 O in Fy. Calculation shows there is 
exactly one such polynomial, p(x) = 27 + 2 + 1 € Fo[x]. Then Fy = 
Fo[z]/(p (2). 

We give the addition and multiplication tables for F4, which you 
may easily verify. 

Note that p(x) has the roots a and 1+ a in Fy, ie., p(x) = 
(s—a)(x—(1+q)) in F4[z]. 
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Also, the Frobenius map ©: Fy->F4, (3) = 87, (see Lemma 
4.1.16) is given by ®(0) = 0, (1) = 1, ®(a) = 1+a, ®(1+a) =a. 


We record the following result here for future use. 


Lemma 4.2.17. Let F be a field and let E be an extension field of F. 
Let f(x) and g(x) be polynomials in Fla] and let h(x) = gcd(f(z), 9(z)) 
in Fla]. Then f(x) and g(x) are polynomials in E[a], so have a ged 
h(x) = gcd (f (x), g(a)) in Ela]. Then h(x) = h(a). 


Proof. By the definition of a gcd, h (2) is divisible in E[x] by every 
polynomial in E[z] that is a common divisor of f(x) and g(x). Now 
h(a) is such a polynomial, so h(x) divides h (a) in E[z]. 

On the other hand, by Corollary 3.5.16, we may write 


h(x) = f (#) a (a) + g (a) b(a) 


for some polynomials a(x), b(z) in F[a]._ 
Now also by the definition of a ged, h (x) divides both f(«) and 
g(x) in E[z], so from this equation we see that h(x) divides h(x) in 
Ez]. - 
Hence, h(x) = h(a). 


4.3 Finite extensions 


We now wish to investigate finite extensions, which we first define. 


Definition 4.3.1. The extension field E of F is finite over F, or E is 
a finite extension of F, if (E/F)=dim, E is finite. .) 


We have already seen an example of this, which will turn out to 
be the prototype. 


Example 4.3.2. Let p(x) € F[z] be an irreducible polynomial of 
degree n. Then, by Theorem 4.2.11, E = F[z]/(p(a)) is extension of 
degree n, so in particular E is a finite extension of F. © 


Let us prove a couple of results which will be very useful for us 
in studying finite extensions. First we have a definition. 


Definition 4.3.3. Let F, B, and E be fields with F C B C E. Then 
B is intermediate between E and F. © 
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Lemma 4.3.4. Let E be an extension of F and let B be intermediate 
between E and F. Then E is a finite extension of F if and only if E 
is a finite extension of B and B is a finite extension of F. In this 


situation, (E/F) = (E/B)(B/F). 


Proof. First suppose that E is a finite extension of F. 

Then dim, E is finite. Now B C E so B is a subspace of E as an F- 
vector space, and so dimp B C dim r E. In particular, dim, B is finite. 

Let dimpE = n, so that E has a basis S = {aj,..., @n} as an 
F-vector space. Then S spans E, so every element of E can be written 
as an F-linear combination of elements of S. That is, if ¢ € E is any 
element, then ¢ = LiL, fia; for some fi,...,fn € F. But F C B so 
f; © B for each 7. Thus, this expression is an expression for E as a 
B-linear combination of elements of 5, and so we see that S spans E 
as a B-vector space. But then S has a subset T that is a basis fo 
E as a B-vector space, so dimg E is equal to the number of element 
in T, which is at most n, i.e., dimgE < n and in particular dimg 
is finite. 

Now suppose that E is a finite extension of B and that B is a 
finite extension of F. Let dimg E = m and dimg E = k. We will show 
the equality in the lemma, dimpE = mk, and then in particular 
dim g E is finite. 

Choose a basis {€1,...,€m} for E as a B-vector space and a basis 
{61,..., 8} for B as an F-vector space. Let 


k 


now 


= 


kb 


A= {Bee | t= 1a gt y = Leceg hf 


We will show that A is a basis for E as an F-vector space. In order 
to show this, we must show that A spans E and that A is linearly 
independent. 

First we show that A spans E. Let a € E. Then, since {€1,...,€m} 
spans E as a B-vector space, we can write 


m 
a= 8: with y%€ 
1=1 


Now {(1,...,8,} spans B as an F-vector space, so we can write 
each y; as yj = a Or with f,;CF. 
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Substituting, we see that 


m k 
a= », FijgP5 | & = S- Fig B5Ei 
i=1 \j=1 i 


and so A spans E as an F-vector space. 
Next we show that A is linearly independent over F. So suppose 


S  faeeea 0 
tJ 


We wish to show each fj; = 0. To do so, we regroup terms. 


m k m 
0= >». fig Bei = ». fig8j | i = S > bei 
iJ i=1 \j=1 i=1 
k 
with 0; = Sig B;- 
j=l 
Now each 6; is in B, and {¢1,...,€m} is a set of linearly indepen- 


dent elements of E over B, so 6; = 0 for each i. 
Thus for each 3, 


k 
(= 7. 
j=l 


But {61,...,6,} is a set of linearly independent elements of B 
over F, so for each i, every fj; = 0. Thus fj; = 0 for all 7, 7, and A is 
linearly independent over F. 

Finally, A has mk elements, so dimp E = mk, as claimed. 


Corollary 4.3.5. Let E be a finite extension of F and let B be inter- 
mediate between E and F. Then: 


(1) B=E @f and only if (B/F) = (E/F). 
(2) B= F éf and only if (B/F) = 1, or, equivalently, if and only if 
(E/B) = (E/F). 
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Proof. Recall that if V is a finite dimensional vector space (over 
any field) and W is a subspace of V, then W=V if and only if dim 
W=dim V. 
Then (1) follows, setting W = B and V = E. 
Also, the first claim in (2) follows, setting W =F and V =B. 
As for the second claim, since (E/F) = (E/B)(B/F) by Lemma 
4.3.4, we see that (E/B)=(E/F) if and only if (B/F) = 1. 


Corollary 4.3.6. Let E be a finite extension of F and let B be inter- 
mediate between E and F. Then (B/F) divides (E/F). 


Proof. This is immediate from the equation (E/F) = (E/B) 
(B/F). 


Remark 4.3.7. With the convention that n-oo = oo-n = oo for 
any positive integer n, and co- 00 = oo, then (E/F) = (E/B)(B/F) in 
general, i.e., Lemma 4.3.4 holds in general. But we need (E/F) finite 
in order for Corollary 4.3.5 to be valid and in order for Corollary 
4.3.6 to give any information. © 


We now want to investigate the notion of composition of field 
extensions. To prepare for doing so, we prove the next lemma, which 
is interesting and useful in its own right. 


Lemma 4.3.8. Let R be an integral domain that is also a finite 
dimensional vector space over a field F. Then R is a field. 


Proof. We need to show that every nonzero element of R is invert- 
ible. So let reR, r # 0. Consider the set {1,r,r?,... }. This is an 
infinite set of elements is a finite dimensional vector space, so is lin- 
early dependent. Consider any nontrivial linear dependence relation 
agr® + (ear +++++amr™ =0, with ag ~ 0. Then 


0 =azr* + ayqar®tt tid gee” =per® (1+ bir +---+bdyr”) 


where b; = ag4; and n = m—k. Since R is an integral domain. The 
first factor on the right hand side is nonzero, and then the second 
factor must be 0. This gives 


L=—byr — +++ = bye” = 7 (by +++ — bar”) 


SO 


rob = by) — ++ bar TER. 
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Definition 4.3.9. Let E be an extension of F and let B, and By be 
intermediate fields between E and F. Then B, and Bg are disjoint 


extensions of F if B} Nn By =F. © 
Definition 4.3.10. Let E be an extension of F and let B,; and Bo 
be intermediate fields between E and F. Their composite D = B,Bo 
is the smallest subfield of E containing B, and Bo. © 


Observe that there is some subfield of E, namely E itself, that 
contains both B, and Bo, and hence there is a smallest such subfield, 
namely the intersection of all such subfields. 

This is an abstract definition. Let us see how to make it concrete. 


Lemma 4.3.11. Let E be an extension of F and let By and Bo be 
intermediate fields between E and F, both of which are finite exten- 
sions of F. Then their composite D = B, Bg is give by 


= {Do dibs | bi © Bib) € Ba}. 


Proof. Call the right hand side R. Clearly any field that con- 
tains both B, and Bz must contain R. So we need only show that 
R is a field. 

Now R contains 1 and is clearly closed under addition and mul- 
tiplication. Also, R has no zero divisors as R C E and E has no zero 
divisors (as E is a field). Thus R is an integral domain. 

R is also clearly an F-vector space, and if B, has basis {(1; |i = 
1,...,7} and Bg has basis {G,|k = 1,...,1} (as F-vector spaces) 
then R is spanned by the finite set {01;,G9, |i =1,...,7, k=1,...,0} 
so is a finite dimensional F-vector space. Thus, by Lemma 4.3.8, 
R is a field. 


Corollary 4.3.12. Let E be an extension of F and let By, and Bg be 
intermediate fields between E and F, both of which are finite exten- 
sions of F. Then: 


(a) (i) (Bi B2/F) < (Bi /F)(B2/F) 
(ii) (Bi B2/Bi) < (B2/F) 
(iii) (B1B2/Be) < (B,/F) 
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(b) Jf By and Bz are disjoint extensions of F, the following are 
equivalent: 


(i) (Bi B2/F) = (B,/F)(B2/F) 
(ii) (Bi B2/Bi) = (B2/F) 
(iii) (B1B2/Bz) = (B,/F) 


(c) If B, and Bz are not disjoint extensions of F, none of the equal- 
ities in (b) are true. 


Proof. (a) (i) follows immediately from the proof of Lemma 4.3.11. 
We know from Lemma 4.3.4 that 


(Bi B/F) = (B,B2/B,) (Bi /F) = (Bi B2/B2) (B2/F) 


from which the rest of (a) and (b) follows. 
(c) We have that 


(B,Bo/F) = (B,B2/B, Bg) (Bi NB2/F) 
(B2/B , N Bo) (By N B2/F) 
(B2/B, N Bo) 

1 B2/F)? / (BN B2/F) 

(B, M Bo/F) (Bo/B, N Bo) 

(B, 1 B2/F) 

= (B,/F) (B2/F) / (Bi 1 B2/F). 


If B, and Bg are not disjoint extensions of F, i.e., if F C By N Bo, 
then (B; 0 B2/F) > 1 and we see that (B,B2/F) < (B,/F)(B2/F). 


Corollary 4.3.13. Let E be an extension of F and let B, and Bo be 
intermediate fields between E and F, each of which is a finite exten- 
sion of F. If the degrees (B,/F) and (B2/F) are relatively prime, then 


(a) B, and Bg are disjoint extension of F; and 


(b) (B,B2/F) = (B,/F)(B2/F). 


Proof. By Corollary 4.3.6, (B; MN Bg/F) must divide both (B,/F) 
and (B2/F), so (By N B2/F) = 1 and then B; 9 Bz = F by Corol- 
lary 4.3.5. 
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Also, (B,Bo/F) < (B,/F)(Bo/F) by Lemma 4.3.12 and is divisi- 
ble by both (B,/F) and (B2/F) by Corollary 4.3.6, so (B,B2/F) = 
(B, /F)(B2/F). 


4.4 Algebraic elements 


Definition 4.4.1. Let E be an extension of F and let a € E. Then 
a is algebraic over F if f(a) = 0 for some nonzero polynomial f(x) € 

An extension E of F is algebraic if every element of E is algebraic 
over F. © 


In other words, a is algebraic over F (which we will abbreviate to 
algebraic, if F is understood) if it is a root of some nonzero polynomial 
f(x) with coefficients in F 


Example 4.4.2. 


TW 


(a) Every element a of F is algebraic over F as it is a root of the 
polynomial x — a in F{z]. 

(b) V2 and—v2 are pov algebraic over Q as they are both roots of 
the polynomial «? — 2 in Q[a]. Indeed, every element of Q[V2] 
is algebraic over Q. To see this, let a € Q [V2], soa=a+bV/2 
with a,b € Q. If b = 0, then a = a € Q is algebraic over Q by 
part (a). Suppose not. Then a — a = bV2, so (a — a)? = 207, 
a? —2aa+a? = 2b?, and then a? — 2aa+ (a? — 2b?) = 0, soaisa 
root of the polynomial f(x) = x? —2axr+(a?— 2b?) in Q[z]. (This 
polynomial is irreducible and its other root is @ = a — by/2.) 

Let p(x) € F{a] be an irreducible polynomial and let E = 
F{a]/(p(x)). Let 2: F[2] + E be the projection and let a = 7(z). 
Then, as we have seen, p(a@) = 0, and so we conclude that a is 
algebraic over F. © 


— 
le) 
WN 


Comparing (c) with (b), we might hope that not only a, but in 
fact, every element of E, is algebraic over F. Indeed, not only is this 
true, but a more general result is true. 


Theorem 4.4.3. Let E be a finite extension of F. Then E is an 
algebraic extension of F. 


Proof. Let (E/F) =n, so that E is an n-dimensional vector space 
over F. Let a be any element of E, and consider the set {1, a,...,a”}. 
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This is a set of n+ 1 elements in E, so must be linearly dependent. 
Thus there are scalars (i.e., elements of F) ao, a1,...,@n, not all zero, 
with ag-l+a,-a+-:-+a,a” = 0. Then a is a root of the nonzero 
polynomial f(x) = anz" +--+: +a12 4+ ao in Fla]. 


Lemma 4.4.4. Let E be an extension of F and let a € E be algebraic 
over F. Let f(x) € Fla]. The following are equivalent: 


(1) f( 
{9 
(2) f( 
f( 
(3) f( 
f(a 


Proof. First recall that Fla] is a PID so every ideal is principal. 
The ideal J is nonzero precisely because a is algebraic over F. Thus 
I has a unique monic generator f(z). 

If g(x) is any polynomial with g(a) = 0, then g(x) € I, so f(x) 
divides g(x). Assume g(x) is monic. Then githier g(x) = f(x) or deg 
g(a) > deg f(a), so (1)<>(2). 

Let g(x) be any monic polynomial with g(a) = 0. Then g(x) 
factors uniquely into a product of monic irreducible polynomials 
g(r) = filz)...fe(z). Then 0 = g(a) = fila),-.-, fala), 50 
fi(a) = 0 for at least one value of i, i.e., g(x) has a monic irreducible 
factor f(x) = fi(x) with f(a) = 0, so (1)S(38). 


x) is the unique monic generator of the ideal I = 
(x) € F(z] | g(a) = 0} of Fle}. 

x) ws re unique monic polynomial in F(x] of lowest degree with 
a 


x) ws th unique monic irreducible polynomial in Flax] with 


)a 
be 
) 
= 


Definition 4.4.5. The unique monic polynomial satisfying the 
equivalent conditions of the above lemma is called the minimal poly- 
nomial of a, denoted mq(x). 0) 


(This language comes from property (2), which says that it is of 
minimal degree.) 


Example 4.4.6. 


(a) Ma(x) = x — a is linear if and only if a € F. 

(b) If F= Q,m 5 (2) = 2? -2€ Q[z]. 

(c) In the situation of Example 4.4.2(c), ma(x) = p(x). .) 
Remark 4.4.7. The polynomial m,(x) depends on F (e.g., if F = Q 
then m /3(x) = x? — 2 while if F = Qv2 then M g(x) = 2 V2) but 
we suppress that dependence from the notation. © 
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Definition 4.4.8. Let E be an extension of F and let a € E be 
algebraic over F. The degree of a (over F) is the degree of its minimal 
polynomial mq(x) € F[z]. o) 


Lemma 4.4.9. Let E be an extension of F and let be algebraic over 
F. Let F(a) be the subfield of E 


F [a] = {f(@) | f(e) € Fia}f. 


Then Fla] is an extension of F of degree equal to the degree of a. 
Also, if E is a finite extension of F, then the degree of a divides 


(E/F). 


Proof. We have a homomorphism y: F|z] — F|a] defined by 
y(f(x)) = f(a). This homomorphism is onto and its kernel is 
I = (m(x)). Thus F[z]/(ma(x)) is isomorphic to F(a). Now 
F[z]/(ma(x)) is an extension of F of degree mq(x), which by def- 
inition is the degree of a, by Theorem 4.2.11. Since F C F(a) C E, 
the second conclusion follows directly from Corollary 4.3.6. 


Now let us turn to a concrete question. Let p(x) € F[z] be an 
irreducible polynomial and let E = F{a]/(p(a)). Given an arbitrary 
element 6 of E, how can we find mg(z)? The answer is simply trial 
and error, guided by our proof of the theorem that 6 must be alge- 
braic over F (Theorem 4.4.3). 

As we have seen, (E/F) = n where deg p(x) = n, and E has basis 
{1,a,...,a”~'}, where a = r(x). Let 8 € E, so that we can express 
B = qi(2) for qi(x) a polynomial in F[z] of degree at most n—1. Test 
if {1, qi(a)} is linearly dependent. (Of course, we know the answer 
here: This will be the case if and only if a € F.) If so, a nontrivial 
linear dependence relation will give us the coefficients of mg(z). 

If not, express 8? = qo(x) with qo(x) a polynomial in F[z] of 
degree at most n — 1. If n is not divisible by 2, 8 cannot have degree 
2, by Lemma 4.4.9. If n is divisible by 2, test if {1, qi(a), q2(a)} is 
linearly dependent. If so, a nontrivial linear dependence relation will 
give us the coefficient of mg(x). 

If not, keep going. We must stop no later than {1, 
qi(@),---,@n(a)} so this procedure will eventually come to an end. 


Example 4.4.10. Let F = Q and let p(x) = 2? + 7x +1, an irre- 
ducible polynomial in Q [z]. Let E = Q[z]/(p(x)) and let 7: Q|a] - 
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E. Set 7 (x) = a. We note that mg(x) = 2° + 7x + 1. Thus a has 
degree 3 (over Q). 


(a) Let 6; = a + 3. We wish to find mg, (x). In this case we begin 

with an observation that will save us some effort. Of course, 

F(G,] C E = F{a]. But notice that a = 6,-3 soE=F [a] C 

F [G1]. Hence, F [G,] = F [a] and hence (1 has degree 3 (over Q) 

as well. Now a is a root of mag(2), i-e., 61-3 is a root of ma(z), 

so (G1 — 3)? +7 (6; —3) +1=0. In other words, is a root of 

the polynomial (2 — 3)? +7 («a — 3)+1 = 23 —9x? +34a—47 and 

since this is a monic polynomial of degree 3, it must be mg, (x). 
Thus, we conclude that mg, (x) = 23 — 9x? + 34x — 47. 

(b) Let Bo = a? + 1. Since Bo ¢ Q, we cannot have deg(82) = 1 

Then, since deg(32) must divide 3, we must have deg((2) = 3. 

+ 


We compute Be = (a? + 1)? =o°4207+1= (a? + 7a+ 1) a 
(—5a? —at 1) so 62 = —5a* — a+ 1. We compute 63 
Be px= (—5a? — a+ 1) (a? +1) = foo de =@q@4l= 
(a? + 7a + 1) (—Ba — 1) + (31a? + 1la+2) so Bf = 31a? + 
lla + 2. Thus, 


{1, B2, 63, b3} = {1, a? +1, —5a? —a +1, 31a? + 1la+2}. 
Now if mg, (x) = 2? + Ax? + Br +C, mg, (61) = 0 is the equation 
(31a? + lla + 2) + A(—5a? —a+1)+B(a?4+1)+C=0. 


Equating the coefficients of the powers of a, this gives the linear 
system 


31-5A+B=0 

11-A=0 

2+A+B+C=0 
with solution A = 11, B = 24, C =-37. Thus, mg,(r) = x? + 1127+ 
24x — 37. v) 


Example 4.4.11. Let F = Q and let p(x) = 24 — 102? + 1. We 
first see that this is an irreducible polynomial in Q{z], as follows: 
By the rational root rest, Lemma 3.11.12, we see that the only 
possible rational roots of p(x) are « = +1. Since p(1) # O and 
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p(—1) # 0, neither of these is a root of p(x). Thus the only possi- 
ble nontrivial factorization of p(x) in Q|z] is as a product of two 
quadratics. Now p(x) is a monic polynomial with integer coeffi- 
cients, so it follows from Gauss’s Lemma (specifically, an application 
of Corollary 3.11.6) that in this case p(x) must have a factoriza- 
tion as a product of two monic quadratics with integer coefficients, 
pts) = (a + a,x + ao) (ze + byx + bo) with ag, a1, bo, 01 € Z. 

Now aobo = 1 so we must have either ag = bp = 1 or ag = bo = 
—1. In either case, the fact that the “x” coefficient of p(x) is 0 forces 


b; = —a,. But then examining the “zr?” coefficient of p(x) gives the 
equation 2 — af = —10 in case ag = bg = 1 and —2 — a? = —10 in 
case ag = bo = —1, but in neither case does this have solution with 


ag € Z. Thus, p(x) is irreducible. 
Let E = Q[z] / (p(x)), 7: Q [2]  E, and a(x) = a. Then a has 
degree 4 with ma(x) = x* — 10x? +1. 


(a) Let 6, = a® — 9a 4+ 2. Since 6, ¢ Q, we cannot have deg(1) 
= 1. We compute 6? = a® — 180% + 403 + 81a? — 36a +4 = 
(a4 — 1007 + 1) (a? = 8) + (4a? — 36a + 12) so 6? = 4a7—36a+ 
12. Thus, 

{1, 1, Bf} = {1, 0% — 9a + 2, 4a? — 36a + 12}. 
Now if mg,(x) = x? + Ax + B, mg, (81) = 0 is the equation 
(40% — 36a + 12) + A (a? —9a+2)+B=0. 


Equating the coefficients of the powers of a, this gives the linear 


system 
4+A=0 
0=0 
—36 —9A=0 


124+2A+B=0 


with solution A = —4, B = —4. Thus mg, (xz) = 2? — 4x — 4. 
(b) Let By = a? + 2a —3. Since Bo ¢ Q, we cannot have deg (82) = 1. 

We compute 63 = a4+4a°—2a?—-12a+9 = (a4 — 10a? + 1) (1)+ 

(4a? + 8a? — 12a + 8) so 63 = 4a? + 8a? — 12a +8. Then 
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{1, Bo, 83} = {1,07 + 2a — 3, 40% + 8a? — 12a + 8}. 
Now if mg,(x) = x? + Ax + B, mg,(f2) = 0 is the equation 
(403 + 8a? — 120 +8) + A(a? + 2a — 3) + B(1) =0. 


Equating the coefficients of the powers of a, this gives the system 


4=0 
8+A=0 
—12+2A=0 
8-3A+B=0 


which has no solution. Thus we conclude 62 does not have 
degree 2. 

We proceed further. We next compute 83 = 8382 = (4a? + 
8a? —12a4+8) (a2+2a—3) = 40°+16a4—8a3—40a?+52a—24 = 
(a* — 10a? + 1)(4a + 16) + (320° + 1200? + 48a — 40) so 63 = 
32a + 120a? + 48a — 40. We then compute 83 = 6382 = (32a? + 
1200? + 48a — 40)(a? + 2a—3) = 32a° + 18404 + 19203 — 3040? — 
2240+ 120 = (a4 — 10a? + 1)(32a +184) + (—52803 — 15360?+ 
256a + 64) so 64 = —528a3 — 15360? + 256a + 64. Thus 


{1, 8, 87, 6°} = {1,07 + 2a — 3, 4a® + 8a? — 12a + 8, 
32a° + 12007 + 48a — 40, —528a? 
—1536a7 + 256a + 64}. 


Now if mg,(x) = 24+ Av? + Bz? + Cz + D, mag, (62) = 0 is the 
equation 


(—528a° — 15360? + 256a + 64) + A(32a3 + 1200? + 48a — 40) 
+B(403 + 8a? — 120 + 8) + C(a? + 2a —3)+D=0° 


Equating the coefficient of the powers of a, this gives the system 
—528+ 3244+4B=0 
—1536 + 1200A+8B+C=0 
256 + 48A —12B+2C =0 
64 —- 404A +8B-3C+D=0 


290 An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


with solution A =-—8, B= -—96, C = 448, D = 624. Thus 
mg,(x) = x* — 843 — 96x? + 448x + 624. > 


Example 4.4.12. Let F = Fy and let p(x) = 2° + 2 + 6, an irre- 
ducible polynomial in F7[z]. (We easily see that p(x) is irreducible, 
as it is a cubic which does not have a root in Fv.) 

Let E = F7[2]/ (p(x)), p: F7|z] > E and a = a(z). 

Let 8 = a?+1. Since 8 ¢ Fy, it must have degree 3. 

We compute 6? = a4 +207 +1= (aF +a+6)at (a? +a+1) 
so 6? = a?+a+1. We compute 6? = 678 = (a? +a+ 1) (a+1)= 
at + a3 +2027 +a4+1 = (a? +a+6) (at1) + (a? +a+4+5) so 
B=a?+a+5. Then 


116 ae} = {1, a? +107 ++ 1,07 +at+5}. 
Now if mg(x) = 2° + Ax? + Br +C, mg(8) = 0 is the equation 

(7? +a+5)+A(e?+a4+1)+B(a?+1)+C=0 
which gives the system 


1-A+B=0 
1+A =0 
5+A+B+C=0 


with solution A = 6, B = 0, C = 3. Thus mg (2) = x? + 62? + 3.6 


Remark 4.4.13. Let us step back from these particular computa- 
tions and think about what we have been doing. We have a field 
E that is an extension of F of some finite degree n. In our case 
K = F[a]/(p(x)) for some irreducible polynomial p(x) in F[a], so we 
had an explicit basis {1,a,...,a”~1!} of E. We considered an ele- 
ment 8 = bn_ja"~!+---+b,a+ bp of E, and asked about its degree. 
Suppose that 6 has degree k, or equivalently that its minimum poly- 
nomial mg(x) has degree k, mg(x) = 2* + Ap_ya*-1 4 +--+ Ap. 
Then 


B=({1,8,...,B*} 


is a linearly dependent set of (k + 1) elements in an n-dimensional 
vector space, and when try to find the coefficients of mg(x), equating 
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the coefficients of the n powers of a (from 1 to a”~!) give us a non- 
homogenous system of n equations in the & unknowns Ao,...,Az—1. 

Now let us think about this situation from a linear algebra per- 
spective. If k < n, then B is a set of at most k elements in an 
n-dimensional vector space, and we expect such a set to be linearly 
independent. Alternatively, if k < n then we expect that a non- 
homogenous system of n equations in k unknowns will not have a 
solution. Thus, either way we would expect that if we pick an ele- 
ment “at random”, we will not have k <n, and so k = n and mg(z) 
has degree n. But also in this case we have F C F([S] C E with 
(F[6]/F) = (E/F) = n, and then E = F[(] by Corollary 4.3.5. 

Now of course in our situation B is not an arbitrary set of ele- 
ments of E, but rather consists of powers of a single element 5, and in 
our situation the coefficients of the powers of a in our linear system 
are not arbitrary but rather are expressions (as it happens, compli- 
cated polynomial expressions) in the coefficients b,1,...,69. So it 
is possible that there is something mysterious going on that always 
confounds our expectations, but that seems highly unlikely. 

In fact, our linear algebra institution is just about right. We will 
see later on that in great generality (thought not quite always) that if 
EK is a finite extension of F and we pick an element of E “at random”, 
then E = F{§]. (The fact that at least one such 6 (almost always) 
exists is called the theorem of the primitive element, Theorem 4.12.2.) 
Indeed, our proof of this theorem will be a linear algebra proof (one 
involving theoretical linear algebra, not writing down systems of lin- 
ear equations), showing how perspicacious this view point is. © 


We conclude this section by recording a simple but very use- 
ful general result. 


Lemma 4.4.14. Let f(x) € Flix] be an irreducible polynomial of 
degree n. Let B be an extension of Fx] of degree m. If m and n are 
relatively prime, then f(x) is irreducible in B[x]. 


Proof. Since f(x) is irreducible in F[z], we know that (F(a)/F) = 
n, where F(a) is the field obtained from F by adjoining a root a 
of F(x). Now (B/F) = m, and m and n are relatively prime, so 
by Corollary 4.3.13 (B/F(a)/F) = mn, ie, (B(a)/F) = mn. But 
B(a)/F) = (B(a)/B)(B/F) so (B(a)/B) = n. But that means that 
a has degree n over B, and so f(x) is irreducible in B[z]. 


— 


292 An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 
4.5 Adjoining elements 


One of the most common, most important, and most useful ways of 
obtaining field extensions is by adjoining elements. Actually, we have 
already used this procedure, without giving it a name. But now we 
will study it more intensively and more systematically. 

Before stating the basic definition, we recall that a rational func- 
tion r(x) is a quotient of polynomials, r(x) = p(x)/q(x) with p(x) 
and q(x) polynomials. 


Definition 4.5.1. Let F be a subfield of E, and let a@ be an element 
of F. The field 


F(a) = {r(x) | r(x) = p(x)/q(x) is a rational function 
with coefficients in F, and g(a) 4 0} 


is the field obtained from F by adjoining a. © 


It is easy to check that F(a) is indeed a field. Also, it the smallest 
field extension of F that contains a@ in the sense that any extension 
F containing a must contain F(a). 

Observe that if a € F then F(a) = F. Thus, adjoining an element 
of F to F gives us nothing new. But if a ¢ F, then F(a) # F, so if 
a ¢ F we do indeed get a new field. 

We remarked that we have already seen and _ used this 
construction—compare Lemma 4.4.9. But in that lemma we only 
needed to use polynomials, while in Definition 4.5.1 we needed to 
use rational functions. What’s the difference? It is because in Lemma 
4.4.9 we were requiring that the element a was algebraic. We now 
recall why that is true (from a slightly different viewpoint, though 
one which, if you trace it back far enough, turns out to be the same) 
and show that the converse is true as well. 


Lemma 4.5.2. Let F be a subfield of E and let a be an element of 
E. Then F(a) = {p(x) | p(x) is a polynomial with coefficients in F} 
if and only if a is algebraic over F. 


Proof. Both conditions are certainly true if a = 0, so we may 
restrict our attention to the casea ¥ 0. 

First suppose that a is algebraic over F, and consider 3 = r(a) = 
p(a)/q(a). (Note g(a) 4 0.) Since a is algebraic, it has a minimal 
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polynomial m,(x). Note that mq(«) is irreducible (by Lemma 4.4.4) 
and hence prime in F{z]. Since g(a) 4 0, ma(x) does not divide 
q(x), and hence mq(x) and q(x) are relatively prime. But F[z] is a 
Euclidean domain, so there are polynomials s(x) and t(x) in Fiz] 
with 


1 = mo(x)s(x) + q(#)t(2). 


But then 1 = ma(a)s(a) + q(a)t(a). Of course, Ma(a) = 0, so 1 = 
q(a)t(a) and t(a) = 1/9q(a). Thus 8 = r(a) = p(a)/q(@) = p(a)t(a) 
can be expressed as a polynomial in a. 

Conversely, suppose that every rational function in a@ can be 
expressed as a polynomial in a. In particular, then, 1/a@ = g(a) for 
some polynomial g(a) with coefficients in F. But then 1 = ag(a) so 
ag(a) — 1 = 0 and ais a root of the polynomial f(x) = xg(x) —1 € 
F[a] and so a is algebraic over F. 


Now let us adjoin more than one element. The basic definition is 
essentially the same. 


Definition 4.5.3. Let F be a subfield of E, and let {a1,a2,...} be 
a subset of E. The field 


F ais G9, cnc} ) = 4 Oar) | Pi Passe) 
= (2,08, .0-)/ OBis Ways xs) 
is a rational function of 71, 22,... with 


coefficients in F, and q(a1, a2,...) # 0} 
is the field obtained from F by adjoining (a1, Q2,...). » 


For convenience, we state the following lemma for two elements, 
though it holds for any finite number of elements. 


Lemma 4.5.4. Let F be a subfield of E and let a1,ag € E. Then 


(a1, 2) = (F(a1)) (a2) = (F(a@2)) (a1). 


Furthermore, F(a1, a2) is the composite F(a,)F(a2). 


Proof. F(a,,q@z2) is the set of rational functions in a, and a2 with 
coefficients in F. By gathering terms together, we may regard any 


294 An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


rational function in a; and a2 with coefficients in F as a rational 
function in a2 with coefficients in F(a1), or as a rational function in 
a, with coefficients in F(a2), and vice-versa. 

Furthermore, it is clear that F(a, a2) is the smallest subfield of 
E containing both F(a,) and F(ag), so F(a,, a2) = F(a1)F(ag2). 


Remark 4.5.5. The way we view this lemma is it says that if 
{a1,...,Q@n} is a finite set of elements of E, we may obtain the field 
F(ay,...,Q@n,) by adjoining all of these elements at once, or by adjoin- 
ing them one at a time, and in the latter case, the order in which we 
adjoin them does not matter. © 


Theorem 4.5.6. Let {a1,...,Qn} be a finite set of elements of E, 
with each a; algebraic over F. Then F(aj,...,Qn) is a finite, and 
hence algebraic, extension of F. More precisely, if a; has degree dj, 
t= 1)55+5%, then (F(O4,...;0,)/F) < dyed. 

Conversely, every finite extension E of F is obtained by adjoining 
finitely many algebraic elements, t.e., E = F(a,,...,@n) for some 
finite set {a1,...,Qn} of algebraic elements of E. 


Proof. We proceed by induction on n. We have already seen the 
case n = 1: F(a) is an extension of F of degree d; (Lemma 4.4.9). 
Now suppose the theorem is true for any set of n—1 elements, and 


consider a set of n elements {aj,...,Q@,}. Let B = F(ay,...,@n—-1) 
so that F CBC F(ay,..., an). 
As we observed in Lemma 4.5.4, F(ai,...,@n) = (F(ai,..., 


Qn—1)) (An) = B(a,). Then by Lemma 4.3.4 we know that 
(F(a1,..-,@n)/F) = (Blan)/F) = (B(@n)/B)(B/F). 


By the inductive hypothesis we know that B is a finite extension 
of F of degree D < dj ---dn_1, so to prove the theorem we need 
only show that B(a,) is a finite extension of B of degree d’ < dy. 

But, by Lemma 4.3.4, B(a,) is the composite BF(a,,), and then, 
by Corollary 4.3.12, 


d' = (B(Qn)/B) = (BF(a)/B) < (F(Qn)/F) = dn, 


and we are done by induction. 

For the other direction, suppose that E is a finite extension of F, 
i.e., that E is a finite dimensional vector space over F. Let (E/F) =n, 
and choose a basis {a1,...,Qn,} of E as an F-vector space. 
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Certainly F(ay,...,@n) C E as each a; € E. On the other hand, 
since a; € E and (E/EF) is finite, we know that a; is algebraic over F, 
for each i. But since {a1,..., Qn} is a basis for E over F, every element 
e of E can be written as ¢ = f17, +---+ fn@y with fi,..., fn © F. 
Now this expression is a linear function of a1,...,@, with coefficients 
in F, so is certainly a rational function of a1,...,@, with coefficients 


in F,andsoE C F(aj,...,Q@,). Hence E = F(qj,..., an). 


Corollary 4.5.7. Let {a1,...,Qn} be a finite set of elements of 
E, each of which is algebraic over F. Let a; have degree dj, i = 
1,...,n. If {dy,...,dn} is relatively prime, then (F(a1,...,Qn)/F) = 
i sae ng tls 


Proof. Since d; = (F(a;)/F), this follows immediately from Corol- 
lary 4.3.13. 


Now we come to a very important notion. 


Definition 4.5.8. Let f(x) € F[z] be a polynomial of degreen > 1, 
f(x) =anxz"+---+a9, and suppose that E is an extension of F such 
that f(z) =an(% — a1)---(@ — ap) in E[a]. Then we say that f(z) 
splits in E[a], and that the extension F(a,,...,@,) of F is a splitting 


field for f(z). .) 
Remark 4.5.9. 


(a) We observe that f(x) splits in E if and only if f(a) has n (not 
necessarily distinct) roots in E. 
(b) An intermediate field B between F and E is a splitting field for 
f (a) if and only if B is the smallest such field in which f(x) splits. 
To see this, note that any field in which f(x) splits must contain 


Q1,---,;Q@n, and hence must contain F(a,,...,@,,), and conversely 
Q4,---;Q@n, and hence F(a1,...,Q@,,), must be contained in any 
such field. © 


Theorem 4.5.10. Let B be a splitting field for f(x) € F[x], a poly- 
nomial of degree n. If f(x) is irreducible, then (B/F) is divisible by 
n. In any case (B/F) < n! 


Proof. Let f(x) have roots a1,...,Q@n. Then F(a) is intermediate 
between F and B, so (B/F) = (B/F(a1)) (F(a1)/F). 

If f(x) is irreducible, we know that (F(a,)/F) = n, and so in this 
case (B/F) is divisible by n. 
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We prove the general inequality by induction on n. It is certainly 
true when f(x) has degree 1. Now suppose it is true for all poly- 
nomials of degree n — 1 over any field, and let f(x) have degree n. 
Consider a root a; of f(x). Then a; is a root of some irreducible 
factor f(x) of some degree n’ < n and so (F(a,)/F) =n’ < n. 
But ay is a root of f(x), so f(x) is divisible by x — a1, and so we 
may write f(x) = (a —.a4)g(x), where g(x) is a polynomial of degree 
n — 1. Note that the coefficients of g(x) are in F(az), ie., g(x) is 
a polynomial of degree n — 1 in F(a,)[z]. But B is also a splitting 
field for g(x) over F(a1), as B = F(aqj,...,@) = F(a1)(a2,...,Qn). 
Then by the inductive hypothesis (B/F(a1)) < (n — 1)! so (B/F) = 
/F(a))(F(a)/B) <n! 


Corollary 4.5.11. Let B be a splitting field for f(x) € Fla], a 
polynomial of degree n. Let f(x) have roots ay,...,Q, in B. Then 
(B/F) = n! if and only if f(x) is irreducible in Fx], and for every 
j 2A, if g;(x) ts defined by f(x) = (x — a1)--- (@ — a5) g(x), then 
gj (x) ts irreducible in F(a1,...,a;)[2]. 


—S 


Proof. The proof of Theorem 4.5.10 shows this stronger result in 
this case. 


Corollary 4.5.12. Let f(x) € F[ax] be a polynomial of degreen > 1. 
Suppose that f(x) = fi(x)--- fe(x) is a factorization of f(x) into 
irreducible polynomials in Fax]. Let fi(x) have degree n;. Let B be a 
splitting field of f(x). Then (B/F) is divisible by n; for each 1, and 
(B/F) < (m1!) --- (nal). 


Proof. The case k = 1 is Theorem 4.5.10. 

Suppose k = 2. Then (B/F) is divisible by both n; and ng, as in 
the proof of that theorem. Also, if a1,...,@,, are the roots of fi (zx), 
and aj,...,a/,, are the roots of fo(a), then 


( /F) = ( (OG phon Gg MY gman dll ) 
eal (Bie. Dag ipa (iis is iy) 
(F(a1,-..,Qn,/F)) 
< (na!) (nr!) 


The general case follows by induction on k. 
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The careful reader will note that in this section, we started out 
not only with a field F but also with a larger field (i.e., an extension) 
EK in which some polynomial f(x) € F{a] had some (or all) roots. But 
how do we know some such field E exists? In particular, how do we 
know that an arbitrary polynomial f(x) € F[z] has a splitting field? 

If you think about it, there was one situation where we started 
just with the field F. It was the situation of Theorem 4.2.4, where 
we began with an irreducible polynomial f(x) € Fa] and produced 
a field where f(x) had a single root. 

So we will now apply that result inductively to show that every 
polynomial has a splitting field. 


Theorem 4.5.13. Let F be a field and let f(x) € Fla] be an arbitrary 
polynomial. Then f(x) has a splitting field E. 


Proof. Let f(x) have degree n. We prove the theorem by induction 
on 7. 

Ifn = 1, then f(x) = a,x + ag has root a = —ao/ai, anda € F, 
so f(x) has splitting field F. 

Now suppose the theorem is true for all polynomials of degree 
n —1, and all fields. Let f(a) have degree n. 

By Theorem 4.2.4 (Kronecker’s theorem) there is a field E; in 
which f(x) has a root a1, so that x — a, is a factor of f(a); more- 
over, Ey = F(a1). Write f(x) = (x — a1)g9(zx), so that g(x) € Ej[z] is 
of degree n — 1. Then by the inductive hypothesis g(x) has a split- 
ting field E, ie., E = Ej(a2,...,Q@,) where a2,...,Q@pn are the (not 
necessarily district) roots of g(x) in E. But then 


BD = iGo os Gi) = FG) ays. q) = Fig of Mm) 


is a splitting field for f(x) over F. 
Then by induction we are done. 


4.6 Examples of field extensions 


In this section, we present a number of examples of field extensions. 


Example 4.6.1. The field Q(\/2). This is a familiar field, obtained 
by adjoining the single element 2 to Q. This element is algebraic 
over Q, with minimal polynomial mj5(x) = x? — 2. Q(V2) is an 
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extension of Q of degree 2, and as a Q-vector space, Q(V2) has basis 


Example 4.6.2. The field Q(V/2, V3), obtained by adjoining the 
two elements V2 and V3 to Q. We claim that this field is an exten- 
sion of Q of degree 4. To see this, note that /2 is algebraic over 
Q with minimal polynomial mj3(x) = x? — 2, and V3 is algebraic 
over Q with minimal polynomial m/3(x) = x? — 3. Thus /2 and 
V3 both have degree 2 over Q, so we see from Corollary 4.3.12 that 
(Q(v2, V3)/Q) < 2-2=4. 

We also know from Corollary 4.3.6 that (Q(V2, V3) /Q) is divis- 
ible by (Q(V/2/Q) = 2. Hence, (Q(V2, V3)/Q) = 2 or 4. 

But — (Q(v2, V3)/Q) = (Q(v2, V3)/Q(v2)) (Q(v2)/Q)__ and 
(Q(V2)/Q)=2, so we just need to show that (Q(V2,V3)/ 
Q(V2))=2. Now Q(V2, V3)= (Q(vV2)(vV3)) so to show that 
(Q(V2, V3)/Q) # 1, in which case it must be 2, we need only show 
that V3 ¢ Q(v2). 

We prove this by contradiction. Suppose that /3 € Q(V2), so 
that /3 = a+ bV2 for some a, b € Q. Then 3 = (a + bV2)" = 
(a? + 2b7) + 2ab/2, so 2ab = 0, and hence a = 0, b = 0. If a= 0 we 
obtain the equation 3 = 2b7, which has no solution with b € Q, and 
if b = 0 we obtain the equation 3 = a?, which has no solution with 
a €Q, so this is impossible. 

Also, Q(/2) has basis {1, 2} as a Q-vector space, and Q(V/3) 
has basis {1, /3} as a Q-vector space, so it follows from Lemma 
4.3.11 that Q(./2, V3) has basis {1-1, /2-1, 1-V3, V2-V3} = 
{1, 2, V3, V6} as a Q-vector space. 

Finally, we observe that Q(V/2), Q(V3), and also Q(6) are inter- 
mediate fields between Q and Q (V2, a/3)s © 


We now present a family of examples that generalizes these two. 


Example 4.6.3. Let T = {a1,...,a:} be a set of integers. For a 
subset S' of T, let Ps be the product of the elements of S. (If S = ¢, 
let Ps = 1). Suppose that T has the property that for no nonempty 
subset S' of Tis Ps a square. For example, this will be the case if the 
elements of T are pairwise relatively prime nonsquares and at most 
one of them is the negative of a square. 
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Let E=Q (4/a, onan a/ Gale We claim that E is an extension of Q 
of degree 2°. We prove this by induction on t. 

For t = 0 this is trivial and we know this for t = 1. Assume it is 
true for any set of t — 1 elements as above and suppose that 7 has 
t elements. Set D = (,/aq,...,,/@i—1). Then (E/Q) = (E/D)(D/Q) 
and by the inductive hypothesis (D/Q) = 2'~!. Thus, we need to 
show (E/D) = 2. Now E = D(,/a) so to show this we need only 
show that \/a; ¢ 

We prove this by contradiction. Suppose ,/a; € D. Then we may 
write /a; = By a1 with x, y EB=Q (./a4, ee (Gi). Squar- 
ing, we ee az = (a? + ay_i1y”) + 2ry,/a—1. Now a, € QC 
xz? + a_ iy? € B, and 2xy € B, so by the inductive hapothiest: 
which implies that Jae-1 e B, Bila must have oe = 0. Then either 
x = 0, in which case a = y?ai-1, a¢a¢_1 = y?a?_, = (yar1)° 
which is impossible by our condition on 7’, or y = O in which case 
Ja; = x € B, contradicting the inductive hypothesis (applied to the 
field Q (,/az,..., Jai2; «/04))- 

We may also argue inductively that has basis {Ps | 
S asubset of T’}, where T’ = {aj,...,a¢-1}, and Q(,/a¢) has basis 
{1, \/ai}, so we see that E has basis {/Ps | S a subset of T}. 

We note that E is a splitting field of the polynomial p(x) = (x? — 
a1)+++ (x7 — a) € Qiz]. 

Finally, there are many fields intermediate between Q and E. For 
example, we have 2’ — 1 quadratic extensions of Q contained in E, 
the fields Q(./Ps) for every nonempty subset S' of T. ?) 


We now turn over attention to a different family of examples. 

We know that the complex number 1 has n complex nth roots. If 
we set C, = exp(27i/n), they are given by ¢*, k =0,...,n —1. We 
thus see that x” —1= (2 —1)(r —G,)-+- (w@ — ¢}). 


Definition 4.6.4. A complex number ¢ is a primitive nth root of 1 
if C is an nth root of 1 but ¢ is not an mth root of 1 for any m <_ n, 
or, equivalently, for any m properly dividing n. The nth cyclotomic 
polynomial ®,,(a) is the polynomial 


[[@-9 


where the product is taken over the primitive nth roots of 1. © 
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We will be investigating the polynomials ®,,(x) in general later 
on. Here we restrict our attention to the case n = p is a prime. 

For p a prime, a pth root ¢ of 1 is either ¢ = 1 or ¢ a primitive 
pth root of 1. Thus, we see 7? — 1 = (a—1)®,(x) and hence ©,(a”) = 
(2? —1)/(2@ —1) =a? 14+ aP 7+... +e4+1. 


Lemma 4.6.5. For p a prime, the pth cyclotomic polynomial ®,(x) 
is irreducible in Q|x]. 


Proof. Since ®,(x) is a monic polynomial with integer coefficients, 
we know that it is irreducible in Q|z] if and only if it is irreducible 
in Z[z]. 

We show that ®,(x) is irreducible by using a well-known trick. 
The polynomial ®,(x) is irreducible if and only if the polynomial 
®,(x + 1) is irreducible. But, by the binomial theorem 


e111 weet < 7 
6,(0+1) = 2 ms Ge : 


Then (7) = 1, ) is divisible by p for 1 < k < _ p, and 


(3 ) = p, so we see that ®,(x + 1) is irreducible by Eisenstein’s 


Criterion (Lemma 3.11.13). 


Example 4.6.6. Let E be the subfield of C that is the splitting 
field of ®,(x) over Q. Then E = Q(G,) and E is an extension of Q of 
degree p—1. (In fact, E = Q(¢k) for any kwith1 < k < p—1.) Asa 


Q-vector space E has basis {1, G,..., aa Note that 0 = ®,(¢,) = 
a -++ +¢G,+1=0 50 we have that ¢,+ --- cet = —1. Then 
we see that E also has basis {¢p,..., pty as a Q-vector space. 


We now look at a couple of special cases of cyclotomic fields. 


Example 4.6.7. Let p = 3. The field Q(¢3) is an extension of Q of 
degree 2. Now ©3(x) = x7 + 4+ 1 is a quadratic, so we may find 
its roots by the quadratic formula. They are (—1 + W—3)/2. Thus, 
¢3 = (-1 + V—83)/2 (where we choose the sign so that /—3 has 
positive imaginary part) and then ¢? = (—1 — V—3)/2 is the other 
root of this polynomial. Thus, we see Q(¢3) = Q(V/—3). .) 
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Example 4.6.8. Let p = 5. The field Q(¢5) is an extension of Q 
of degree 4. We will find ¢5 explicitly. To this end, consider B = 
Q(¢s) OR. Then 4 = (Q(¢5)/Q) = (Q(¢s)/B)(B/Q), and so either 
(B/Q) = 1, in which case B = Q, or (B/Q) = 2, in which case we 
also have (Q(¢;)/B) = 2. We will show the latter case occurs. 

To that end, note that i: =(5,s00=+ es ER. We find @ as 
follows: We compute 6? = (¢5 + ¢3)? = (2+ (3 +2. Now 14+654+ (7+ 
CB +65 =0, s0 +8 =-1-C-G=-1-8. 

Hence, 6? = (—1 — 6) 4+ 2, or 6? +0 —1=0. In other words, @ is 
a root of the quadratic polynomial 2? + 2 — 1. From the quadratic 
formula we see that this polynomial has roots (—1 + /5)/2, so 0 = 
(—1+ V5)/2. (Since ¢ and ¢4 have positive real part, their sum is a 
positive real number.) You can easily check that 6’ = ¢? + ¢3 is also 
a root of this polynomial, so 6’ = (—1 — V/5)/2. Thus we see that 
Q(4) = Q') = Q(v5). 

Now G3 = ae so 6 = (5 + Ge which gives the equation ¢? — 
6¢, +1 = 0. In other words, Cs is a root of the quadratic x? — 
0x + 1 € B[z] (and the other root of this quadratic is ¢?). Again we 
apply the quadratic formula to find that this polynomial has roots 
[((-1+ V5) + V5 —2V5]/2. 


Since ¢; has positive imaginary part, we see that 


_ (-14+ V5) + V-5 — 2Vv5 
one ae 


We observe, as expected, that (Q(¢5)/B) = 2, as 


Q(¢s) = (y/-5-2v3) and —5—2V5eB. © 


We turn to a third family of examples. To prepare for them, we 
have the following result, which we state in complete generality. 


Lemma 4.6.9 (Abel). Let F be a field and let a € F. If p is a prime 
anda is not a pth power in F, then the polynomial x” —a is irreducible 


Proof. We shall show that if 2? — a is not irreducible, then a is a 
pth power in F. So suppose 7? —a = f(x)g(x) in F[z], where f(x) has 
degree d and g(x) has degree p—d, for some d with 1 < d< p-—1l. 
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Let E be a splitting field of the polynomial x? — a. Then we see 
that E = F(¢, a) where ¢? = 1 and a? =a, so x? —a = (x — a)(x — 
Ca) ++ (0 - 1a). 

(Although this is not strictly necessary for the proof, we should 
observe that if char (F) # p, then ¢ # 1, as if ¢ = 1, then we 
would have x? —1 = (a—1)? = a? —pa?—!---, which is not the case, 
as p # 0 in F. On the other hand, if char(F) = p then ¢ = 1, as in 
this case we do have x? — 1 = (x — 1)? by Lemma 4.1.14.) 

Thus, 


f(x)g(x) = 2? —a = (w@—a)(e—Ca)--- (ew — GPa). 
But we have unique factorization in E[z], so f(x) must be the product 
of d of these terms and g(x) must be the product of p — d of these 
terms. But the constant term of f(x) is + ¢*a% for some integer k, 
so B = CKa@ € F. Now d and p are relatively prime, so there is a 
positive integer e with de = 1 (mod p), i.e., de = mp +1, so 
pe = (far = Chege = ge i ~ Cha cF 
and then 7 = C**a € F. But 


yP = (Cha)? = CPRgP = (1)F’aP =a ck 


and a is a pth power in F. 


Example 4.6.10. Let p be a prime and let n € Q be a rational 
number that is not a pth power in Q. Let f(x) = 2? —n € Q[z] and 
let E C C be the splitting field of f(x). Then E = Q(G, #/n). (Note 
that f(a) splits in this field, and that any field in which f(x) splits 
must contain ¢, and ?/n, so this field is indeed the splitting field). 

Now E is the composite E = Q(¢,)Q(¥#/n). Now ¢, is a root of 
the cyclotomic polynomial ®,(x), of degree p— 1, and we showed in 
Lemma 4.6.5 that ®,(a) is irreducible. Hence, (Q(¢,)/Q) = p— 1. 
Also, ?/n is a root of the polynomial x? — n, of degree n, and 
we showed in Lemma 4.6.9 that this polynomial is irreducible. 
Hence (Q(%/n)/Q) = p. Then (E/Q) = p(p — 1) by Corollary 
4.3.13. We know that Q(G,) has basis is Gil | (or basis 
Vonage Vy) as a Q-vector space, and that Q(¥/n) has basis {1, 
en, (wn)*,...,(%/n)P-1} as a Q-vector space, so E has basis 
(i! (4 — 0) agi 29 —Onsp— 1) 1m! | 4 = 
dicey — 1,7 = Ope2239— 1h. ?) 
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Now we return to reconsider some of our earlier examples. 


Example 4.6.11. Let us consider the polynomial f(a) = x? + 7x + 
1 € Q/z], as we did in Example 4.4.10. From elementary calculus 
we see that f(a) has exactly one real root, which we shall call a4. 
Let B = Q(a1). Then (B/Q) = 3, since f(x) is irreducible in Q[z], 
as we have already seen. But we note that B is not a splitting field 
of f(a) over Q, as B C R and, as we have just remarked, only one 
of the roots of f(x) is in R. Let E D B be a splitting field of f(x), 
so that E = Q(aj, a2, a3) where aj, a2, and ag are the roots of 
f(z) in E. Then (E/Q) < 3! =6 by Theorem 4.5.10, and (E/Q) is 
divisible by (B/Q), so (E/Q) = 6. But then (E/B) = 2. Now, setting 
a = a1, we see that f(x) has a factor of x —a in Biz], ie., f(z) = 
(a — a)g(x) € Blix] for some polynomial g(x) € B[z], and simply 
dividing polynomials we see that g(2) = 2? + ax + (7+ a7) € Biz], 
and g() is an irreducible quadratic in B|z]. Indeed, we would obtain 
the exact same results, and formula, if we set @ = ag, or a= az. 


Example 4.6.12. Let us consider the polynomial f(x) = ++nx?+ 
1 € Qjz], for n an integer, n # +2, generalizing Example 4.4.11. 
Let a@ be a root of this polynomial, and let B = Q(a). Then (B/Q) = 
4, since f(x) is irreducible in Qj], as we will show below. Note 
that, since all of the powers of x in f(a) are even, —a is also a 
root of f(x). Also notice that since this polynomial is palindromic, 
i.e., the coefficients are the same left-to-right as they are right-to-left 
(1,0,n,0,1), 1/a@ is a root, and then —1/qa is a root as well. Thus, 
we see that B is a splitting field of f(x), that f(x) = (x —a)(x — 
(—a))(a — 1/a)(a — (—1/a)) € Bla], and that if a,,...,a4 are the 
roots of f(z) in a splitting field E = Q(a1, a2, a3, a4), then in fact 
b= QA(a, a, 1/a, —1/a) _ Q(a@) = B. 

We now show our claim that f(x) is irreducible in Q|z]. The only 
possible rational roots of f(x) are +1, by Lemma 3.11.12, and neither 
of these is a root, so f(x) cannot have a linear factor in Q[z]. Thus 
if f(x) is not irreducible, it must be a product of two quadratics 
in Q[z]. We cannot have a factor (x — a)(x — (—a)) = x? — a? as 
then we would have to have a? = +1, which we have excluded. We 
cannot have a factor (x — a)(x — 1/a) = 2? —(a+1/a) +1, as then 
a+1/a=k for some integer k, and then a? —ka+1 = 0, with roots 
(k + Vk? —4)/2, and ifk # +2, k? — 4 is never a perfect square. 
(If k = +2, then a = +1 again.) Similarly, we cannot have a factor 
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(2 —a)(x — (—1/a)) = x? —(a—1/a)x—1 as this gives a—1/a = k, 
x? — ka — 1 = 0 with roots (k + Vk? + 4)/2, and if k 4 0, k? +4 is 
never a perfect square. (If k = 0, then a = +1 again.) 

Finally, in the cases we have excluded, f(x) is reducible: 


e409? = (e441)? and g°=8e7-41 = (271) o) 


Let us now see how to obtain some infinite algebraic extensions. 


Example 4.6.13. Let pi, po, p3,... , be distinct primes, and let E = 
QU/P1, /P2; \/P3,.--). Then for any k, E > By = Q(,/p1,.--, Dk) 
and by Example 4.6.3, (B,/Q) = 2*. Thus (E/Q) > 2* for every k, 
so (E/Q) = oo. But note also that, considering the bases we wrote 
down in that example, every element a of E must have an expression 
that involves only finitely many of \/p1, \/p2,.--, so is in the finite 
extension B;, for some k, and hence is algebraic. © 


Example 4.6.14. Let p1, po, p3,..., be distinct primes, and let B = 
Q(*/2, ”/2, */2,...). Then for any k, E > By = Q(*/2,..., */2). 
By Lemma 4.6.9, Q(%/2/Q) = p; for each i, so by Corollary 4.3.13, 
(B./Q) = pi,.--,; pr. Thus (E/Q) > pi,..., pr for every k, so again 
(E/Q) = oo. Also, by the same argument as in the preceding example, 
every element @ of E is algebraic. 


Now let us return to an example of finite fields. 


Example 4.6.15. Let us consider the polynomial f(x) = 2°? + a+ 
6 € F7|z], as we did in Example 4.4.12. Let a be a root of this 
polynomial and consider F7/a]. Then (F7(a)/F7) = 3, since f(x) is 
irreducible in, F7[2], as we have seen. Once again we may ask whether 
7(a) is a splitting field of f(a). The answer turns out to be yes, as 
f(x) has the roots aj = a, ag = 1+ 5a”, and a3 = 6 + 6a + 2a”, 
all in F7(a). Thus f(x) = (a — a1)(x — a2)(@ — a3) in F7(a), and 
7(a1, Q2, a3) = F7(a). > 


Remark 4.6.16. Once you are given a2 and a3, you may check by 
direct substitution that f(a2) = 0 and f(ag) = 0. But of course you 
may-indeed you should-ask, how we know that f(x) splits in F7(q), 
and, given that, how to find the other roots of f(x) in F7(a). We will 
be answering these questions below. © 
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4.7 Isomorphisms of fields 


Before getting to work in this section, let us reflect on some of our 
examples. 

Let us begin by considering E = Q[z]/(p(x)) where p(x) € Qjz] 
is the irreducible polynomial p(x) = x? — 2. Then, as we have seen, 
E = Q(a), ie., E is obtained from Q by adjoining an element a@ with 
a” = 2. We have also looked at the field Q(V/2), obtained from Q by 
adjoining 2, an element with (/2)? = 2. But we could also have 
obtained Q(V2) by adjoining a different element, —/2, which also 
satisfies (—\/2)? = 2. Now somehow these should be algebraically 
“all the same”, that is, somehow, from a purely algebraic point of 
view, we should not be able to distinguish between them. 

Similarly we could consider E = Q{z]/(p(x)) where p(x) € Q|z] is 
the irreducible polynomial p(x) = 23 —2. Then E = Q(a) is obtained 
from Q by adjoining an element a with a® = 2. But we also have the 
field Q(¥/2), obtained by adjoining W/2, an element with (¥/2)? = 2, 
the field Q(¢3/2), obtained by adjoining ¢3¥/2, where (¢3/2)? = 
2, and the field Q(¢?¥/2), obtained by adjoining ¢?/2, where also 
(¢? \/2)? = 2, and again they should somehow be algebraically “all 
the same”. 

Moreover, we have the notion of a splitting field of a polynomial 
f(x) € F[z], and we constructed such a splitting field E. But again, 
somehow all splitting fields of f(x) should somehow be algebraically 
“the same”. 

This intuition turns out to absolutely correct, once we make the 
notion of “the same” precise. And the precise way to say that is that 
“the same” means isomorphic. Thus we are led to study the notion of 
isomorphism of fields, which we first introduced in Definition 4.1.9. 

It turns out that the notion of field isomorphism is a crucially 
important one, one that lies at the very heart of Galois theory, which 
is our ultimate goal in this chapter. 

So, with a view toward future developments, we will be stating 
(and proving) our results in a more general form than is just needed 
to make these intuitions precise. 

Before proceeding, there is a point we need to stress to the reader, 
one that appears throughout mathematics, and one whose impor- 
tance cannot be overstated. Suppose we have two objects, of what- 
ever sort, X and Y, and we have an isomorphism, of whatever sort, 
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f: X — Y. Then X and Y are indeed isomorphic, but that is not 
(repeat, not) (repeat again, not) to say they are identical. True, the 
isomorphism f may give a way of identifying X with Y, but that 
identification depends on the isomorphism f, and different isomor- 
phisms will give different identifications. 

Before constructing field isomorphisms, let us see an essential 
restriction on how they must behave. 


Lemma 4.7.1. Let Fy and F2 be fields and let yo: Fy — Fo be a 
field isomorphism. Let E, be an extension of Fy and let Eg be an 
extension of Fo. Suppose that yp: Ey > Eg is a field isomorphism 
extending yo. Let 3 be an arbitrary element of Ey. If pi(x) € Fy[z] 
is any polynomial having B as a root, and po(x) = yo(p1(x)) € Fela], 
then y = y(8) must be a root of the polynomial po(x). 


In particular, if E, and Eg are extensions of F, y: Ey > Eo is an 
isomorphism that is the identity on F (perhaps Eg = Ey and y is an 
automorphism of E; that is the identity on F), 8 is an element of Ey, 
and p(x) € Flax] is a polynomial having 6 as a root, then y = ~(f) 
must be a root of p(x) in Eg. 


Proof. Let p1(%) = anz” +---+ ag € Fi[z], in which case po(x) = 
~o(an)x” +-+-+ yo(ao) € Fo[x]. We then have 


p2(y) = p2(e(B)) = vo(an)(y(B))” +--+ + vo(ao) 
= y(an)(y(B))” +--+ + v(ao) 
= p(anB”) +--+ + vag) 
= (a@nB”" +--+ + ao) 
(p1(8)) = p(0) = 0. 


= 


We would like to prove a converse of this lemma. But, thinking 
about it, we can see two restrictions. If p(x) were not irreducible, but 
instead, for example, pi(x) = qi(x)ri(x) for distinct irreducible poly- 
nomials qi(a) and 71(a), we could not have an isomorphism taking a 
root of qi(x) to a root of r1(x). Also, if F1(G) C Ey and Fo(y) C Es, 
we could expect to know y: F,(3) — Fe(y), but could not expect to 
say anything at all about » (not even whether it exists) on all of E; 
or all of Ey. But these restrictions turn out to be the only ones. 
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Lemma 4.7.2. Let F; and Fo be fields and let yo: Fy > Fo be a field 
isomorphism. Let p(x) € Fy[x] be an irreducible polynomial and let 
p2(x) = go(pi(x)) € Fala]. If B is any root of pi(x) in an extension 
field E; of Fi, and y is any root of po(x) in an extension field Eg of 
Fo, then there is a unique isomorphism yp: F1(8) > Fo(y) extending 
yo with p(B) = +. 


In particular, if Ey and E2 are extensions of F, perhaps Ey = Ey, 
p(x) € Fla] ts an irreducible polynomial, 8 is any root of p(x) in Ey 
and y is any root of p(x) in Eg, then there is a unique isomorphism 
yp: F(B) > F(q) that is the identity on F with p(B) = 7. 


Proof. We have a ring homomorphism ¢: F,[z] — F,(@) given 
by e1(f(x)) = f(8). Note that e;(2) = 6. The homomorphism €, is 
onto with kernel 1, = {f(x) | f(G) = 0} = (pi(z)), the principal 
ideal of Fj |] generated by pi(x), and so we obtain an isomorphism 
é1: Fy[z]/, > Fi(8) with €:(x) = 6. Similarly, if Ig = (po(x)), the 
principal ideal of F2[z] generated by p2(x), we obtain an isomorphism 
€9: Fo[x]/Ig > Fo(y) with €:(x) = 7. 
Now the isomorphism yo: F, — F2 extends to an isomorphism 
yo: Fi[z] > Fo[x] with yo(x) = 2, and if p(x) = yo(pi(z)), this 
induces an isomorphism Go: Fy[x]/Iy > Fe[x]/To. 

Then the isomorphism ¢ is given by yp = €(%o)(€1)~+, and 
(8) = 7. 

Furthermore, y is unique, as it is determined by its values on 
elements of F; and on the element £. 


Now we turn our attention to isomorphisms, and automorphisms, 
of splitting fields. 


Lemma 4.7.3. Let F, and Fo be fields and let yo: Fy — Fo be an 
isomorphism. Let p(x) € Fy[a] be an arbitrary polynomial and let 
p2(x) = yo(pi(x)) € Fela]. If Ey is any splitting field of pi(x) and Eg 
is any splitting field of p2(x), then there is an isomorphism yp: Ey > 
Eo extending Yo. 


In particular, if F is a field and E, and Eg are any two splitting 
fields of p(x) € Fla], then Ey, and Eg are isomorphic via an isomor- 
phism yp that restricts to the identity on F. 
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Proof. By induction on n = deg pj; (x) = deg po(x). 
Ifn = 1, Ey = Fy, Eo = Fo, and we let y = yo. 
Assume now the lemma is true for all polynomials of degree less 

that n, and all fields, and let p;(x) have degree n. 

Let (1,...,@8n be the (not necessarily distinct) roots of p;(x) in 
E1, so that Ey = F1((1,...,8n). Let y1,...,%m be the (not necessarily 
distinct) roots of p(x) in Eg, so that Ep = Fo(%1,..-,9n)- 

Now (3; is a root of some irreducible factor qi(x) of pi(x) in Fy [2]. 
After possible reordering, we may assume that 7, is a root of the 
irreducible factor g2(z) = y(qi(x)) of po(x) in Fo[z]. Then by Lemma 
4.7.2 there is an isomorphism ~: F1(61) > Fe(71) extending Yo. 

Observe that hy = Fi (C1, mae En) = 1(Po, aoa Bu) where {= 
1(B1), and similarly 2 = Fo(m1, ae Yn) = 2(Y2; sess Yn) where 

2 = Fo(y1). Note that pi(x) is divisible by x — 6; in By[x] and 

p2(x) is divisible by x — y, in Bg[x]. Write pi (x) = (# — 61)ri(x) and 

po(x) = (w@—741)r2(xz). We observe that ro(x) = v(r1(z)), polynomials 

of degree n — 1. 

Now E, is a splitting field of r|(a) (regarded as an extension of 

1) and Eg is a splitting field of ro(x) (regarded as an extension 
of Bz). Thus, by the inductive hypothesis, there is an isomorphism 
yp: Ey > E» extending w, and thus extending yo as well. 


We want to emphasize that the isomorphism y constructed in 
Lemma 4.7.3 is almost never unique-quite the opposite! We have the 
following result, which, for simplicity, we state in the most important 
special case. 


Corollary 4.7.4. Let p(x) € Flax] be an irreducible polynomial, and 
let E be a splitting field of p(x). Then for any two roots a, and az of 
p(x) in E, there is an automorphism py: E > E extending the identity 
on F with y(ay) = ae. 


Proof. We showed this in the proof of Lemma 4.7.3. In the situ- 
ation here, {(1,...,8n} = {71,---;%n} but we may choose 3; = a, 
and 1, = ag. 


Remark 4.7.5. The isomorphism y in Corollary 4.7.4 may or may 
not be unique. Suppose that p(x) has roots aj,...,Q@, in E. Then 
EK = F(ay,...,Q,). If it happens that E = F(a), then ¢ is unique. 
But otherwise, y is not, as we also see from the proof of Lemma 
4.7.3. ©) 
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4.8 The Galois group: Definition and examples 


Our goal is the fundamental theorem of Galois theory. But we have 
come for enough to be able to define the Galois group, to compute 
it in some cases of field extensions that we have already seen, and to 
draw some general conclusions about it. 

Here is the basic definition. 


Definition 4.8.1. Let E be an extension of the field F. The Galois 
group Gal(E/F) is the group 


Gal(E/F) = {automorphisms y: E > E | ¢ is the identity on F}, 


a group under composition. © 


Example 4.8.2. Let a € Q not be a perfect square and let E = 
Q(./a), an extension of Q of degree 2. We observe that E is a splitting 
field of the irreducible quadratic p(x) = 2? — a € Qa]. By Lemma 
4.7.1 we observe that any automorphism y of E must have y(./a) = 
+,/a, and from Lemma 4.7.2 that either of these is possible. Hence 


G = Gal(E/Q) = {20,1} 


with o9 = id (the identity) and 0; determined by o1(./a) = —,/a. 
We observe that G is a group of order 2, isomorphic to Zo. © 


Example 4.8.3. Let T = {a1,...,a:} be a set of integers as in 
Example 4.6.3 and let E = Q(,/aj,...,,/a;) as in that example. We 
saw there that E is an extension of Q of degree 2’. We observe that E 
is a splitting field of the polynomial p(x) = (x? —a,)(a?—az) ... (x? — 
az) € Q|a]. By Lemma 4.7.1 we observe that any automorphism y 
of E must have y(,/aj) = &./ai,¢; = +1, for each 7. Thus, we see 
that G = Gal(E/Q) will have order at most 2° and will have order 
exactly 2° if and only if we can choose all these signs independently. 
We show we can do so, by induction on f. 

In case t = 1 this is just the previous example. Now suppose it is 
true for any set T’ = {a ,...,a¢-1} and consider T = {aj,..., a}. 
Let D = Q(,/ai,...,,/ae—1). Then (D/Q) = 2°! and (E/Q) = 2', 
so (E/D) = 2. Clearly E = D(,/a¢) so ay has degree 2 over D, and so 
the polynomial x? — a; is irreducible in D[z] (not just in Q[z]). Now 
let €, = +1,...,€ = +1 be any collection of signs. By the induction 
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hypothesis there is an isomorphism yo: D — D with yo(./ai) = 
ej,/aj for i= 1,...,t—1. But then by Lemma 4.7.2 there is a (unique) 
isomorphism y: E — E extending yo and with p(./ar) = €¢,/a for 


each choice e; — +1 or e = —1. Then by induction we are done. Thus, 
G has order 2°. 
We can concretely describe G. For each i = 1,...,t, let oj: E> E 


be the automorphism with o;(\/ai) = —/a; and o;(./a;) = \/aj for 
j # 1%. Then G is the group generated by {o1,...,04}. We observe 
that o? = id for each i, and that ojo; = 0;0; for each i,j, and so G 
is isomorphic to the group (Z2)’. o) 


We now return to Example 4.6.6 and investigate it more deeply. 


Example 4.8.4. Let ®,(x) be the pth cyclotomic polynomial, p a 
prime, and let E C C be a splitting field of ®,(x). Then E = Q(¢,) 
and, since ®,(x) is an irreducible polynomial of degree p—1, E is an 
extension of Q of degree p—1. Now ©,(z) has roots G,..., 5 ‘in E. 
Then, by Lemma 4.7.2 (or Lemma 4.7.3) there is an automorphism o; 
of E with o; (Gp) = ch for each k = 1,...,p—1, and, since E = Q(G,), 
such an automorphism is unique. (Compare Remark 4.7.4.) Thus, 
G = Gal(E/Q) is a group of order p — 1. To further investigate the 
structure of G, note that 


eonlG) =a) = GG)" = (GY =o 


and so we see that G is isomorphic to the multiplicative group Z> (a 
cyclic group of order p— 1). © 


We shall continue to look at examples of Galois groups of exten- 
sions of Q, but will do so in a different order than in Section 4.6, in 
order to work our way up from easier to harder. 


Example 4.8.5. Let us look at a splitting field E of a polynomial 
p(x) = a4 + nx? +1 € Qa], n an integer, n 4 +2, as in Example 
4.6.12. We observed there that p(x) is irreducible. If we let Q(a) be 
a field obtained by adjoining a root a of p(x) to Q, we observed that 
p(x) splits in Q(a)|z]; to be precise 


p(x) = (@ — a)(a — (—a))(@ — 1/a)(@ — (—1/a)) in Qa) [a]. 


Thus, E = Q(a, —a,1/a,—1/a) = Q(a), and E is an extension of Q 
of degree 4. Once again, by Lemma 4.7.2, there are automorphisms 


Field Theory 311 


y of E with y(a) = a, y(a) = —a, (a) = 1/a, and y(a) = —1/a, 
and once again, since E = Q(a), these automorphisms are unique. 
Thus, G = Gal(E/Q) is a group of order 4, and it is easy to check 
that it is isomorphic to Z2 @ Zg. © 


Example 4.8.6. Let us consider a splitting field E of the irreducible 
polynomial p(x) = «3 + 7x +1 € Q{z] of Example 4.6.10. As we saw 
in that example, p() has three distinct roots, which we shall call 
a, 0, and 7, in E, and E is an extension of Q of degree 6. By Lemma 
4.7.1 any o € Gal(E/Q) must permute a, 3, and 4. 

Let B = Q(a). Then (B/Q) = 3 as p(x) is irreducible. Then 
p(x) = (w—a)g(x) in B[ax] and g(x) must be an irreducible quadratic 
in B[z], as otherwise g(x) would split in B/z] and we would have 
EK = B. Then by Lemma 4.7.2 we have an automorphism o of E 
ectiidine the identity o9 on B (where of course oo(a) = a) with 

o(8) = 8 and o(7) = ¥ (of course, this automorphism of E is just the 
identity) and an maar o of E extending o9 with o(8) = y 
and o(7) = £. 

Now a and £ are both roots of the irreducible polynomial p(x), 
so again by Lemma 4.7.2 we have an isomorphism a9: Q(a) > Q() 
with o9(a) = 6, and once again by Lemma 4.7.2 we have an auto- 
morphism o of E extending 09 with o(3) = a and o(y) = 7, and also 
an automorphism o of E extending oo with o(8) = y and o(7) =a. 

By exactly the same logic we obtain automorphisms o of E with 
o(a) = 7, o(8) = B, and o(7) = a, and with o(a) = 7, o(8) = a, 
and o(7) = P. 

Thus we see that Gal(E/Q) is isomorphic to the symmetric group 
S3 acting as the full group of permutations of {a, 6,y}. We observe 
that this group has order 6. © 


Example 4.8.7. Let p be a prime and let n € Q be a rational 
number that is not a pth power in Q. Let f(x) = x? — n and let 
= Q(G, ¢/n) as in Example 4.6.10. For simplicity, set a = ¢/n. 
As we observed there, f(x) is irreducible in Q/z] and E is a split- 
ting field of f(x). We note that E has subfields B = Q(¢,) and 
= Q(a), and that (B/Q) = p—1 as the cyclotomic polynomial 
®,(x), of degree p — 1, is irreducible in Q[z] by Lemma 4.6.5, and 
also that (D/Q) = pas the polynomial f(x) = x? —n is irreducible in 
Qa] by Lemma 4.6.9. Then, by Lemma 4.4.14, ®,(x) is irreducible 
in D[x] and f() is irreducible in B[z]. 
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Thus, by Lemma 4.7.2, there is an element o € Gal(E/B) with 
o(a) = Gpa; regarded as an element of Gal(E/Q) o is defined by 
o(a) = Ga, o(Gp) = Gp. Note that o is an element of order p. 

Also, by Lemma 4.7.2, there is an element 7 € Gal(E/D) with 
T(¢p) = ¢, where r is a primitive root (mod p); regarded as an 
element of Gal(E/Q) 7 is defined by 7(¢p) = G, T(a) = a. Note that 
T is an element of order p — 1. 

Now since E = Q(¢p, a), an element of E is determined by its 
action on ¢, and a. We compute: 


aTije=o Fa)=o a) = Ca 


oT (Gp) =o (7) =O (G)=G 


and 

To(a@) = T(a(a)) = T(Gpa) = Gra 

TO(Cp) = T(a(G)) = T (Cp) = a 
so we see that o'r = Tao € Gal(E/Q), ie., that to7~! = o” in 
Gal(E/Q). 


Finally, we note from Lemma 4.7.1 that any automorphism of E 
fixing Q must take a root of ®,(x) to another root of @,(«), giving 
p—1 possibilities, and also take any automorphism of E fixing Q must 
take a root of f(x) to another root of f(x), giving p possibilities, so 
the order of Gal(E/Q) is at most p(p — 1), and exactly p(p — 1) if 
these choices can be made independently. But the group generated 
by o and 7 has order p(p — 1). Thus, we see that 


Gal( E/Q) = (0,7 | oP = ig = 1,tor—! = a”) 


a nonabelian group of order p(p—1), and that for any 7 with 1 <j < 
p—1and any k with 0 < j < p—1, there is a unique p € Gal(E/Q) 
with p(x) = ¢*a and p(¢,) = G. (As an abstract group, this group 
is independent of the choice of primitive root r; changing r amounts 
to changing generators.) } 


We conclude this section with a general construction for finite 
fields, and apply it to an example. 

Recall that if E is a field of characteristic p, we introduced the 
Frobenius endomorphism ®: E + E given by ®(e) = &€? for every 
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e € E in Definition 4.1.17, and that if E is finite then ® is an 
automorphism. 


Lemma 4.8.8. Let p be a prime and let E be a field of p” elements. 
Then ®: E > E is an automorphism of order n. 


Proof. For any positive integer i, ®’ is given by ®'(e) = cP". If 6! 


is the identity, then ®*(e) = € for every «¢ in E, ice., & —€=0 for 


every € € E, so the polynomial f(x) = 2?’ — x has p” roots. But this 
is a polynomial of degree p’, so has at most p’ roots, so if i < n & 
cannot be the identity. 

On the other hand, if i = n, *(e) = € is the equation €?” —¢ = 
e(e?"-! — 1) = 0. If e = 0 this is certainly true. If ¢ 4 0 then e is an 
element of the multiplicative group E* of order p” — 1, so € has order 
dividing p” — 1, and hence ¢?"~! = 1 and then ¢€?"~' — 1 = 0. Hence 
®"(e) =e for every € € E. 


Corollary 4.8.9. Let p be a prime and let f(x) € F,[z] be an 
irreducible polynomial of degree n. Let E = F,(a) be obtained by 
adjoining a root a of f(x) to Fy. Then f(x) has the distinct roots 
a, ®(a),...,®"-'(a) in E, so that E is a splitting field of f(x). 
Furthermore, Gal(E/F,) is a cyclic group of order n generated by ®. 


Proof. We have observed that E always has the Frobenius auto- 
morphism. Then by Lemma 4.7.1, ®(a) is a root of f(x) for each i. 
We cannot have ®'(a@) = a for any 0 < i <n as any automorphism 
of E is determined by its action on a, and that would give ®’ = id 
for 7 <n, which is impossible by Lemma 4.8.8. Moreover, we cannot 
have ®J(a~) = ®*(a) for any 0 < i < j < nas that would given 
®/~*(a@) = a which is similarly impossible. Thus, we see that f(z) 
has the n distinct roots a, ®(a),...,®"~!(a) in E, so f(x) splits in 
KE. And again by Lemma 4.7.1, if o is any automorphism of E, we 
must have o(a) a root of f(x) in E, i.e., c(a) = &'(a) for some i, in 
which case ¢ = ®°. 


Example 4.8.10. Let f(z) = 23 +2+6 € Fy[2] as in Example 
4.6.15. As we saw there, this is an irreducible cubic in F7[z]. Let 
KE = F(a) be a field obtained by adjoining a root a of f(x) to 
‘7. Then (E/F7(a)) = 3. But as we have just seen from Corollary 
4.8.9, in this situation E is a splitting field of f(x). Furthermore, 
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f(x) has the roots a, ®(a) = a’, and ®?(a) = (a’)’ in E. We 
compute (as we did in Example 4. 4, 12) and we find that a’ = 1+ 
5a’, and (a’)’ = (1+ 507)’ = 6+ 6a + 2a?, as we claimed in 
Example 4.6.15. v) 


Remark 4.8.11. We call the reader’s attention to the fact that in 
every example in this section, where E was the splitting field of a 
polynomial f(x) € F[z], and f(a) had distinct roots in E (which is 
often, but not always, automatic) we had the equality |Gal(E/F)| = 
(E/F). As we shall see, this was no accident! 


4.9 Normal, separable, and Galois extensions 


In this section, we introduce the key notion of a Galois extension E 
of a field F. Then we are immediately faced with the question of how 
to decide when an extension is Galois, and we answer that question 
as well. 


Definition 4.9.1. Let E be a field and let G be a group of automor- 
phisms of E. Then F = Fix(G) is the subfield of E given by 


Fix(G) = {a € E| o(a) =a for every o € G}. .) 
Remark 4.9.2. It is easy to check that Fix(G) is indeed a field, as for 
any 0 € G we have o(0) = 0,0(1) = 1,0(a7~') = o(a)7!, (a4 8) = 
o(a) + 0(8), and o(aZ) = o(a)o(@) for a, 6 € E. 


Definition 4.9.3. Let E be an extension of F. Then E is a Galois 
extension of F if 


Fix(Gal(E/F)) = F. 0) 


Remark 4.9.4. By the definition of the Galois group, every ele- 
ment o of the Galois group fixes F. Thus we automatically have 
Fix(Gal(E/F)) > F, and so E is Galois exactly when this fixed field 
is as small as possible, i.e., equal to F. © 


Our criterion for E to be Galois is in terms of two other properties 
of field extensions. 


Definition 4.9.5. An extension E of F is normal if every irreducible 
polynomial p(x) € Fiz] that has a root in E splits in E[z]. .) 
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Remark 4.9.6. We see that this definition is equivalent to: An 
extension E of F is normal if for every a € E, its minimal polynomial 
M(x) € Fiz] splits in E. ©) 


kb 


Remark 4.9.7. There are plenty of extensions that are normal, but 
there are plenty of extensions that are not. For example, let p be 
an odd prime, n € Q not a pth power, and consider the polynomial 
f(z) = x? —n € Q|a]. As we have seen in Lemma 4.6.9, f(z) is 
irreducible in Q|z]. Let E = Q( #/n). Then E contains one pth root of 
n, but no others, so f(x) does not split in E[z] and E is not a normal 
extension of Q. © 


Definition 4.9.8. An irreducible polynomial p(x) € F[z] is separable 
if all of the roots of p(x) in some, or equivalently every, splitting field 
of p(x) are distinct. 

An arbitrary polynomial p(x) € F[z] is separable if all of its irre- 
ducible factors are separable. 

An extension E of F is separable if every element of E is a root of 
a separable polynomial. © 


Remark 4.9.9. We see that the last part of this definition is equiv- 
alent to: An extension E of F is separable if for every a € E, its min- 


imal polynomial m,(x) has distinct roots in some, or every, splitting 
field. v) 


Remark 4.9.10. There are plenty of extensions that are separable, 
but extensions that are not are much “rarer”. In fact, for many fields 
F every extension of F is separable. We do not want to interrupt our 
main line of argument now, but we will return to this point at the 
end of this section. © 


Remark 4.9.11. We see that an extension E of F is normal and 
separable if for every a € E, its minimal polynomial splits into a 
product of distinct linear factors in E[z]. .) 


Here is our criterion for an extension E of F to be Galois. 


Theorem 4.9.12. Let E be a finite extension of F. The following 
are equivalent: 


(1) E is a Galois extension of F. 
(2) E ts a normal and separable extension of F. 
(3) E is a splitting field of a separable polynomial p(x) € Fiz}. 
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Proof. (1) = (2): Let E be a Galois extension of F. Let a € E 
be arbitrary, and let ay = a,a2,...,Q@py be the distinct elements of 
{o(a) | o € Gal(E/F)}. Let m(x) = (a — ay)...(x — ap). Then, 
since Gal(E/F) permutes {a1,...,Qn}, o(m(x)) = m(a) for every 
o € Gal(E/F). Writing m(r) = 2” + an_y2""! +--+ +9, we have 
that o(a;) = a; for each i = 0,...,n—1. Since E is a Galois extension 
of F, this implies that a; € F for each i, and so m(x) € F(z]. Thus 
m(a) is a separable polynomial (as its roots are distinct) that splits 
in E[z]. 

(2) = (3): Let {e1,...,¢¢}, be a vector space basis for E over F. 
For each i = 1,...,e, let m;(x) € F{a] be an irreducible polynomial 
having €; as a root. Then each m;(a) is separable (as E is a separable 
extension of F) and splits in E (as it has the root e; € E and E is 
a normal extension of F). Let p(a#) = m4(x) ... me(x). Then p(z) is 
a separable polynomial that splits in E. If B C E is a splitting field 
for p(x), then e; € B for each i, so B D F(e,,...,€-) = E and thus 
= M 


(3) = (1). Let n = (E/F). We prove this by complete induction 
on n, and all fields. 
If n = 1 the claim is trivial. (F is certainly a Galois extension of 
.) Suppose it is true whenever (E/F) = n’ < n and let (E/F) = n. 
Let p(x) have roots aj,...,a, in E. Some a; ¢ F as otherwise E = 
. Renumbering if necessary, we may assume aq, ¢ F. For simplicity, 
let a = a1. Let B= F(a). 
Then E is a splitting field of p(x) € Biz] and (E/B) < n, 
so by the inductive hypothesis E is a Galois extension of B, i.e., 
= Fix(Gal(E/B)). Since Gal(E/B) is a subgroup of Gal(E/F) we 
certainly have that Fix(Gal(E/F)) = D C B. We wish to show that 


Let ma(x) € Fla] be the minimum polynomial of a over F, and 
let Ma(x) € D[z] be the minimum polynomial of a over D. Since 
C B = F(a), we see that D(a) = F(a), and so 


deg q(x) = (D(a)/D) = (F(a)/D) < (F(a)/F) = deg ma(x), 


with equality if and only if D=F. 

We show that a(x) = mq(x), thereby proving this equality. Let 
Ma(x) have distinct roots a1,...,Qm in E. (As mq(z) is irreducible in 
F [a], it must divide p(x), which is a separable polynomial, so the roots 
of mq(x) in E must be distinct.) Then ma(z) = (x1)... (4@—-Gm). 
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Now maq(x) divides mq(x) in D[z], hence in E[z], so the roots of 
M(x) must be a subset of {a1,...,Qm}. We claim that every a; is 
a root of M(x). 

Let Eo be a splitting field of ma(x) in E. Since ay and a; are two 
roots of the irreducible polynomial m,(x), by Lemma 4.7.2 there is 
an isomorphism o;: F(a,) > F(a;) with o;(a,) = a;, and oj|p = id. 
Then by Lemma 4.7.3 0; extends to an automorphism, which we 
still denote o;, of Eo, oj: Eg — Ep. Now E is a splitting field of 
p(x) € Fla], so is a splitting field of p(x) € Ep/z], so by Lemma 4.7.3 
again o; extends to an automorphism, which we again still denote 
by o;, of E. Thus o;: E > E with o;|r = id, so o; € Gal(E/F) and 
oi(a,) = a;. Now by definition D = Fix(Gal(E/F)), and ma(x) € 
[x], 80 o;(Mo(x)) = M(x). But then (recalling that a = a) 


Moi) = Ma(7i(a1)) = Fi(Ma)(Fi(a1)) = Fi(Ma(a1)) = a;(0) = 0 


as claimed. 


Corollary 4.9.13. Let E be a finite Galois extension of F. If B is 
any field intermediate between E and F, then E is a Galois extension 


of 


Proof. By Theorem 4.9.12, E is a splitting field of a separable 
polynomial p(x) € Fiz]. But F C B, so p(x) € B[z]. Thus E is a 
splitting field of the separable polynomial p(x) € B[z], so, again by 
Theorem 4.9.12, E is a Galois extension of B. 


Now we return to the question of separability. 


Definition 4.9.14. Let f(x) € F[z] bea polynomial, f(x) = a,x" + 
An_1x"-1+--++a,2+<aQ9. Its formal derivative f'(x) = Df (x) is the 
polynomial f(x) = naynz”—! + (n — Lan_i2™-? +--+ +44. ?) 


As you can see, the formal derivative is the same as the derivative 
you learned in calculus. But we are calling it the formal derivative 
as we are considering this purely algebraically — we have no notion 
of limit here as we do in calculus. 

You should also be familiar with the following result from calculus. 
We have to prove it separately here, as we can’t use limit arguments. 
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But the purely algebraic proof is in fact much simpler, since we don’t 
have to worry about limits. 


Lemma 4.9.15. (a) The formal differentiation operator D: F|x] > 
Fla] ts a linear transformation, i.e., D(af(x) + bg(x)) = aDf (x) + 
bDg(x) for any a,b € F and any f(x), g(x) € Fla]. 
(b) (Leibniz rule) For any f(x), g(x) € Fla], 
D(f(a)g(a)) = Df (a)g(a) + f(x) Dg(a). 
Proof. 


(a) Is easy to verify. 
(b) Given (a), we need only check this when f(x) and g(x) each 
consist of a single term, say f(x) = 2’, g(x) = a’. But then 


D(f (x)g(x)) = D(a'*?) =(i + j)iti-} 


Df(x)g(z) + f(x)Dg() = (Da')a! + 2°(De’) 
= (ia") (x!) + (2')(ja?~") 


Lemma 4.9.16. Let f(x) € F[z] be an irreducible polynomial. Then 
f(x) is a separable polynomial if and only if f(x) and f'(x) are rel- 
atively prime in F{z]. 


Proof. Let g(x) and h(x) be any two polynomials in Fz] and let 
E be any extension field of F. Let d(x) be the gcd of g(x) and h(x) as 
polynomials in F[{z] and d(x) be ged of g(x) and h(a) as polynomials 
in E[z]. Recall we showed in Lemma 4.2.17 that d(x) = d(x). In 
particular, g(x) and h(x) are relatively prime in F(z] if and only if 
they are relatively prime in E[z]. We apply this here. 

Let E be a splitting field of f(x). 

First suppose that f(x) is separable. Then f(x) = (w—ay)...(x— 
Qn) in E[z], with a1,...,Qp, distinct. Then f’(x) = (x — ag)... (a — 
On) + (@ — a1) (4% — ag) ...(@ — On) +--+ + (w@— 1)... (& — n_1). 
Any nontrivial factor of f(#) must be divisible by « — a; for some 
i, but we see that f’(x) is not divisible by any x — a;. Hence f(z) 
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and f’(z) have no nontrivial common factors, i.e., f(a) and f’(a) are 
relatively prime. 

Now suppose that f(x) is not separable. Then f(x) has at least 
one repeated root a in E[z], so that f(x) is divisible by (2 —a)* with 
e > 2. Write f(x) = (x—-a)°g(x). Then f’(x) = e(x—a)*"19(x)+(2— 
a)©g' (a) and we see that f(x) and f’(a) have the nontrivial common 


factor (x — a)®1, i.e., f(x) and f’(x) are not relatively prime. 


Corollary 4.9.17. (a) Let F be a field of characteristic 0. Then every 
polynomial f(x) € F[x] is separable. 

(b) Let F be a field of characteristic p. Then every irreducible 
polynomial f(x) = O%_,a;x' with at least one term a;x' with a; #0 
with i not divisible by p is separable. 


Proof. In case (a), we may assume f(z) is irreducible. 

In both of these cases, f’(x) is a nonzero polynomial of lower 
degree than f(x). Since f(x) is irreducible, f(x) is relatively prime 
to f’(x). 


Corollary 4.9.18. Let F be a field of characteristic p. If the Frobe- 
nius ®: F + F is onto, then every polynomial f(x) € Fla] is sep- 
arable. In particular, if F is a finite field, then every polynomial 
f(x) € F[z] ts separable. 


Proof. It suffices to consider f(a) irreducible. Then by Corollary 
4.9.18, the only possible nonseparable polynomials are of the form 
f(x) = Ui 9apjz). Now suppose ®: F — F is onto. Then for every 
j, we have ap; = ®(bp;) = bf, for some element bp; of F. Thus, 


p 
m m m 
= ify i P pj — ro) 
f(z) = S high ys = 5 oe = y dp gat 
j=0 j=0 j=0 


is not irreducible. 


We are led to the following definition. 


Definition 4.9.19. A field F is perfect if every polynomial f(x) € 
Fz], or, equivalently, every algebraic extension E of F, is separable. 


We can now conclude: 


Lemma 4.9.20. Every field of characteristic 0, and every finite field, 
is perfect. 
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Proof. This is immediate from Corollary 4.9.17 and 
Corollary 4.9.18. 


We close with an example of a nonseparable extension. 


Example 4.9.21. Let F = F,(t), the field of rational functions in 
the variable ¢ over F,. Let f(x) = x? — t € Fla]. By Abel’s theorem 
(Lemma 4.6.9) f(x) is irreducible. Let E = F(s) be the field obtained 
by adjoining a root s of f(x) to F. Then s? = t, and we see 2? —t = 
xP — sP = (x — s)P is not separable. .) 


4.10 The fundamental theorem of Galois theory 


In this section, we arrive at the fundamental theorem of Galois the- 
ory, one of the great theorems of mathematics. 

Before we get there we have the following result, which is very 
important and useful in its own right. 


Theorem 4.10.1. Let p(x) € Fix] be a separable polynomial and let 
E be a splitting field of p(x). Then 


|Gal(E/F)| = (E/F). 


More generally, if D is any field intermediate between E and F, 
then 


| Gal(E/D)| = (E/D). 


Proof. We prove the general case. The particular case is just the 
special case of the general case when D = F. 

Let Gp = Gal(E/D) and e = (E/D). We prove the theorem by 
complete induction on e. 

If e = 1, then E = D and Gal(E/D) = {id} has order 1. 

Now suppose the theorem is true for all e’ < e, and all fields. Let 
EK be an extension of F with e = (E/D) > 1. Since E is a splitting field 
of p(x) € Fla], we have that E = F(ay,...,Q@,) where a1,...,Q, are 
the roots of p(x) in E. Then also E = D(aj,...,a,,). We may assume 
that a1,...,Q@, ¢ D as otherwise e = 1. 

Consider B = D(a). There are two cases: 

Case 1: E = B. In this case the element a, has degree e over D, so 
its minimum polynomial m,, (x) € D[z] has degree e. Now mq, (x) 
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is irreducible in D[z], and p(x) has aq as a root, sO Mq,(x) divides 
p(x) in D[a]. We are assuming that p(x) is separable, so 7%, (x) is 
separable and hence 7, (x) has e distinct roots a1,...,@e in E. Now 
by Lemma 4.7.1 any 0 € Gp must have o(a;) = a; for some i, and 
by Lemma 4.7.2 there exists 0; € Gp with o(a,) = a; for each i. 
Furthermore, since E = D(a;), this element o; is unique. Thus we 
see Gal(E/D) = {01,...,0¢} so |Gal(E/D)| = e = (E/D). 
Case 2: E > B. Let (B/D) = 6 and let Gg = Gal(E/B). Then e = 
(E/D) = (E/B)(B/D) so (E/B) = e/b. Then E is a splitting field of 
p(x) € Bz], so is a Galois extension of B by Theorem 4.9.12. Thus 
Fix(Gal(E/B)) = B, so by the inductive hypothesis we have that 
IGa| = (E/B). 

By the same logic as in case 1, Mq,(x) € D[z] is an irreducible 
polynomial of degree b with distinct roots ay,...,a@» in B. Let 


H, ={o € Gp | o(a1) =ai}, i=1,...,0. 
Then, by Lemma 4.7.1, Gp = LI, H;. Now 
Hy = {0 € Gp | o(ay) = ay} 
= {o € Gp | olp = id} = Gg. 


We claim that H; is a left coset of H, for each i > 1. To see 
this, let 0; € Gr be any element with o(a,) = a;,1=1,...,b. Such 
elements o1,...,0p exist by Lemma 4.7.2. We show that H; = 0;f, 
= Tsuda 

First, suppose o € o;H;. Then o = o;7 for some 7 € Hy. But then 


8 


o(a1) = oin(a1) = o;(n(a1)) = oi(01) = a 


so o € H;. Thus o;H, C Hj. 
Next, let o € H;, so that o(a1) = a;. Now o;(a1) = a; so 


o-to(a1) = oF (o(a1)) = oF (aa) = 1 


so oo = © H;. But then o = o;n € o;H,, so H; C oj. 
Thus H; = 0o;H,, and so Hy = Gg is a subgroup of Gp of index 
b. But then 


|Gp| = [Gp: Ga]|Ga| = b|Gp| = b(e/b) = e = (E/D) 


as claimed. 
Then by induction we are done. 
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We observe a very important consequence. 


Corollary 4.10.2. Let E be a finite Galois extension of F. Then 


|Gal(E/F)| = (E/F). 


Furthermore, for any field B intermediate between E and F, 


|(Gal(E/B))| = (E/B). 


Proof. This follows immediately from Theorem 4.10.1, Theorem 
4.9.12, and Corollary 4.9.13. 


Theorem 4.10.3 (Fundamental theorem of Galois theory (FTGT)). 
Let E be a finite Galois extension of F and let G = Gal(E/F). 


(a) There is a 1—1 correspondence between intermediate fields E D 
> F and subgroups {1} C Gg C G given by 


= Fix(Gp) > GR = Gal( E/B). 


(b) B is a normal extension of F if and only if Gg is a normal sub- 
group of G. This is the case if and only if B is a Galois extension 
of F. In this case 


Gal(B/F) © G/Gp. 


(c) For each E> BOF, 


(B/F) = |G: Gg] and (E/B) = |Ggl. 
Before proving the FTGT, we will make several observations. 


Remark 4.10.4. 


(a) Note that Gg = {1} and Gg = G. Also, for intermediate 
fields/subgroups of the Galois group, 


1 C Bo => Gp, Cc GR. 


(b) Recall from Corollary 4.9.13 that E is a Galois extension of B 
for every intermediate field B. 
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(c) Since B is a subfield of E and E is a separable extension of F, 
then B is certainly a separable extension of F. (If every element 
of E is separable, then certainly every element of B is separable.) 
This justifies the claim in (b) that B is a normal extension of 
if and only if B is a Galois extension of F. 

(d) As the proof of the FTGT shows, if B is a Galois extension of 
the quotient map 7: G — Gal(B/F) is given by restriction, i-e., 
1T(o) =olp. 


Proof. 


(a) For each subgroup H of G, let 
=F A). 


This gives a mapping 


T: {subgroups of G} — {fields intermediate between E and F}. 


We show I is a 1—1 correspondence. 
Tis 1—1: Let Hy and Hz be subgroups of G. Suppose that By, = 
H»- Let H be the subgroup of G generated by Hj and Hz. Then 
H = By, = By,. Then, by Theorem 4.10.1, 
|H| = |Gal(E/By )| = |Gal(E/Buy, )| = || 
= |Gal(E/By,)| = |H2| 
But H, C H and Hy C H, so we must have H, = H = Hy. 
T is onto: Let B be an intermediate field between E and F. Let 
H ={o0€G| ol|p = id} = Gal(E/B) C Gal(E/F). 

As we have observed, E is a Galois extension of B, so = 

Fix(Gal(E/B)) = Fix(#). 
(c) We have seen in Theorem 4.10.1 that 


(E/F) =|Ge| and (E/B) = |Gp| 


Now 

(E/F) = (E/B)(B/F) 
and 

|Gr| = |Ga|[Gr: Ga] 
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(b) Suppose that Gg is a normal subgroup of G. For any o € G, we 
have o(B) = Fix(oGgo~') = Fix(Gg) = B. Hence, we have a 


restriction map R: Gal( 


H/F) + Gal( 


/F) given by R(o) = ols, 


and so we have an isomorphism 7: G/Ker(R) > Im(R). 


Now Ker(R) = {a € Gal( 


Also, Im(R) = Gal(B/ 
00: —> 
is a Galois extension of 


polynomial p(x) € Fix 
isomorphism oc: 


Conversely, suppose that 


E/F) | olp = id} = Gal(E/B). 
) as follows: Let o9 € Gal(B/F), i-e., 
is an isomorphism which is the identity on F. Now E 
, hence a splitting field of a separable 
, so by Lemma 4.7.3 o9 extends to an 
HE > E. Thus o € Gal(E/F) with R(o) = oo. 

is a Galois extension of F. Then 


is a splitting field of a separable polynomial p(x) € F[z], and 


so B = F(j,... 


o(B) = B. Then 


Fix(Gg) = 


» Br) where Ai, oo 
. Now if o is any element of Gal( 


and so a permutes (4,... 


., 6, are the roots of p(x) in 


E/IF), then o(p(x)) = p(a), 


, 5, and so o leaves 


B= o( 


invariant, 1.e., 


B) = Fix(oGgo') 


so by (a), Gg = cGgo~!. Since o was an arbitrary element of 


Gal(E/ 
Gal(E/F). 


), we conclude that Gal( 


/F) is a normal subgroup of 


We can also make the following observation about the relationship 
between fixed fields and subgroups of the Galois group. 


Lemma 4.10.5. Let E be a finite Galois extension of F and let G = 
Gal(E/F). Let H and K be subgroups of G and let B = Fix(H) and 


= Pic(K). Then 


(a) The composite 
(b) BN 
and K. 


Proof. 


fixes D, and hence a fixes 


o fixes B, so o € H, and o fixes 


= Fix(H 1K); and 
= Fiz(L), where L is the subgroup of G generated by H 


Ifo € HK then o € H, soo fixes B, ando € K, soa 
. On the other hand, if o fixes B 
,soo0 € K,and henceo €C HN K. 


, then 


Also, if o(€) = € for every o € L, then o(e) = € for every o € H, 


so € € B, and o(e) = € for every 0 € K, soe € 


On the other hand, if € € 


ND, then € € 


, and hence € € BND. 
, so o(€) = € for every 
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o € H, and « € D, so o(e) = € for every o € K, so o(e) = € for every 
o EL. 


Our viewpoint here has been to start with a field F and go to 
an extension field E. But we could look at things from the reverse 
viewpoint: Start with a field E and go to a subfield F. From this 
reverse viewpoint we see the following result. 


Corollary 4.10.6. Let E be a field and let G be a finite group of 
automorphisms of E. Let F = Fia(G) be the subfield of E fired by G. 
Then, 


(a) E is a Galois extension of F; 
(b) G= Gal(E/F); and 
(c) (Artin) (E/F) = |G|. 


Proof. 


(a) Let G be the group of all automorphisms of E that fix F. By 
definition, G = Gal(E/F). Then G is a subgroup of G. We claim 


G =G. To see this, note that Fix(G) C Fix(G). But then 


F C Fix(G) C Fix(G) =F. 


Hence, F = Fix(G) and so, by the definition of a Galois extension, 

E is a Galois extension of F. 7 - 

(b) We have that G is a subgroup of G with F = Fix(G) = Fix(G), 
ie., G and G have the same fixed field. But then, by part (a) of 
the fundamental theorem of Galois theory, G = G. 

(c) This is now immediate from (b) and part (c) of the fundamental 
theorem of Galois theory, or, alternatively, from (b), Theorem 
4.9.12 and Theorem 4.10.1. 


Remark 4.10.7. There is a well-known and (justly) well-regarded 
proof by Artin of the fundamental theorem of Galois theory in which 
he uses the equality in part (c) of the above corollary as an essential 
step in the proof. In our proof we have instead used Theorem 4.10.1, 
so in our approach this equality is a consequence of the fundamental 
theorem of Galois theory. © 


Corollary 4.10.8. Let E be a finite extension of F. Then 
|Gal(E/F)| < (E/F), with equality if and only if E is a Galois exten- 
sion of 
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Proof. Let B = Fix(Gal(E/F)). Then F C B, Gal(E/F) = 
Gal(E/B), and E is a Galois extension of B. But then by Theorem 
4.10.1, 


|Gal(E/F)| = |Gal(E/B)| = (E/B) < (E/F) 


with equality if and only if B = F, ie., if and only if E is a Galois 
extension of F. 


Let us further investigate the structure of Galois groups. 


Theorem 4.10.9. Let f(x) € Fiz] be a separable polynomial of 
degree n, and let E be a splitting field of f(x). Then the Galois group 
G = Gal(E/F) is isomorphic to a subgroup of Sy, the symmetric 
group on n elements. If f(x) is irreducible, then G is isomorphic to 
a transitive subgroup of S;,. If f(x) is a product of irreducible polyno- 
mials f(x) = fi(x)... fe(x) with deg fi(x) = n;, then G is isomorphic 
to a subgroup of Sp, X--- X Sn,.- 


Proof. Let aj,...,@p be the roots of f(x) in E, so that E = 
F(aj,...,Qn,). We regard S,, as operating on the set {a1,...,Qn}. 
Then we have a homomorphism y: G — S;,, defined as follows: If 
g € G then o = ¢y(g) is the permutation defined by o(a;) = g(a;). 
Furthermore, ¢ is 1—1 as if g(a;) = a; for every i = 1,...,n, then 
g: E > E is the identity automorphism. 

Also, we have already seen that if f(x) is irreducible, there is an 
automorphism g of E with g(a;) = aj; for any i,j, and so G acts 
transitively on {a1,...,@,} in this case. In any case, G preserves the 
set of roots of each of the individual irreducible factors of f(x). 


Corollary 4.10.10. Let f(x) € Flax] be a separable polynomial of 
degree n, and let E be a splitting field of f(x). If f(x) is irreducible, 
then (E/F) is divisible by n and (E/F) divides n!. If f(x) is a product 
of irreducible polynomials f(x) = fi(x)... fe(x) with deg fi(x) = ni, 
then (E/F) divides ny!...nx!. 


Proof. This is immediate from Theorem 4.10.1 and Corollary 
4.10.6. 


Remark 4.10.11. Note that this is a strengthening of Corollary 
a5 13, 0 
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Even before stating the fundamental theorem of Galois theory, we 
were able to find many Galois groups in Section 4.8. We now revisit 
some of those examples. But before we look at specific examples, let 
us look at Remark 4.8.11. We can now see that the reason for the 
equality there is that every example in that section was an example 
of a Galois extension, by Theorem 4.9.12, and then the equality in 
that remark is due to Theorem 4.10.1. 


Example 4.10.12. Let E = Q(\/az,,/a2) where neither aj, ao, 
nor a,a2 are perfect squares. As we saw in Example 4.8.3, in this 
case Gal(E/Q) = {000, 001, 710, 011} is isomorphic to Zz 6 Zz, where 
aij (ai) = (—1)' /a@ and oj; (\/a2) = (—1)) faz. Then we see that 
G has three subgroups of order 2, Hy = {000,001}, H2 = {000,710}, 
and H3 = {090,011} with fixed fields Fix(H,) = Q(,/az), Fix(H2) = 
Q(,./az), and Fix(H3) = Q(\/aja2). Since G' is abelian, each of these 
subgroups is normal, and so each of these intermediate fields is a 
Galois extension of Q. Furthermore, we see that these three fields 
are all of the fields strictly intermediate between E and Q. © 


Example 4.10.13. Let p be a prime and let E = Q(¢,) as in Exam- 
ple 4.8.4. Then, as we have seen, E is an extension of Q of degree 
p—1 and G = Gal(E/Q) is isomorphic to Z>, a cyclic group of 
order p—1. A generator r of Z} is known as a primitive root (mod 
p), so that r?-! = 1 (mod p) but r?’ #4 1 (mod p) for any i with 
0 <i< p-—1. Then G is generated by the automorphism o, of E 
given by o,(p) = G- Set o = oy. 

Now E is a splitting field of the pth cyclotomic polynomial ®,(zx) 


whose roots are eee cae Then (iy Cpaseagte  y form a vec- 
tor space basis for E over Q, and 1+ G,+-:-+ a = 0. Thus 
(a ph is also a vector space basis for E over Q. 


By the fundamental theorem of Galois theory, the fields interme- 
diate between E and Q are in 1—1 correspondence with subgroups 
Hof G. Now G is a cyclic group of order p — 1, so has a unique 
subgroup Hg of order d for every d dividing p — 1, and these are 
all the subgroups of G. Let s = (p — 1)/d. Then Hg is generated 
by o°. 

Then the intermediate fields between E and Q are the fields Eg = 
Fix(Hq) for every d dividing p — 1. 


328 An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


Noting that G permutes the elements of the basis {¢p,..., a 


E over Q, we see that 


ar) 


oO 


d 


d 
Eq =Q(Oa) where 64¢= 5 (0°)'(G) = > 0*(G) 
11 


4 i=1 
= Ss" ee, 
i=1 


We note that (E/Eg) = d and (Eg/Q) = s, and we observe that 
Kq = Gal(Eg/Q) = Gal(E/Q)/Gal(E/Eg) is the cyclic group of order 
s generated by a, the restriction of o to Eg. 

Note in Example 4.6.8 we performed a special case of this 
construction. In our notation here, ¢; = ¢5 and 0 = @ 9, and 


(Q(65)/Q(A)) = 2, (Q(A)/Q) = 2. 0 


Example 4.10.14. We return to Example 4.8.5 and adopt the nota- 
tion there. We have that G = Gal(E/Q) is a group of order 4, 
isomorphic to Zz ® Zy with G = {00,01,02,03}, where oo(a) = 
a,o1(a) = —a,02(a) = 1/a, and o3(a) = —1/a. Thus G has three 
subgroups Hy = {00,01}, H2 = {00,02}, H3 = {00,03} of order 2, 
with associated subfields Fix(H,) = Q(a), Fix(H2) = Q(a+a~4), 
and Fix(H3) = Q(a — a"). Each of these is a quadratic extension 
of Q, and these are all the fields strictly intermediate between E 


and Q. v) 


Example 4.10.15. We return to Example 4.8.6 and adopt the nota- 
tion there. We have that G = Gal(E/Q) is a group of order 6, 
isomorphic to the symmetric group 53, operating as the group of 
permutations of {a, 3,7}, the roots of the irreducible polynomial 
p(x). From our knowledge of the structure of S3, we see that G 
has three subgroups of order 2 (= index 3), and the fixed fields 
of these subgroups are F(a), F(G), F(7). We observe that these sub- 
groups are not normal subgroups of G, so that these fields are not 
Galois extensions of Q. We also know that G has a normal subgroup 
of order 3 (= index 2), and so the fixed field of this subgroup is 
QA) for some A € E, a quadratic, and Galois, extension of Q. We 
can choose A = (a—(3)(8—)(a—7). To see this, note that certainly 
A € E, but if o € G is an odd permutation, then o(A) = —A so 
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A ¢ Fix(G) = Q. On the other hand A? is fixed by every element of 
G so A? € Q. (We will elaborate on this below.) ?) 


Example 4.10.16. Let us return to Example 4.8.7. To recapitulate, 
we considered there f(x) = 2? —n € QJ], for p an odd prime and 
n € Q not a pth power. Then f(z) is irreducible, and we have E = 
Q(G, /n), a splitting field of f(x), with (E/Q) = p(p—1). We found 
there that G = Gal(E/Q) is a nonabelian group; more precisely 


G=@7 |e =19' S=146r* =e") 


where o and 7 are the automorphisms of E given by o(?/n) = 
Co(W/n), o(Gp) = Gp and 7(Y/n) = Yn, T(G) = oe r a primitive root 
(mod p). Then G = N x H is the semidirect product of the normal 
subgroup N, generated by o, of order p, and the subgroup H, gen- 
erated by 7, of order p — 1. As we observed in Example 4.10.15, H 
is cyclic. 

Let us find all intermediate fields between E and Q. Again, we do 
so by finding all subgroups of G. 

Let g € G. Since G = N x H, we see g can be written uniquely 
as g = o'7I, 0 <i<p—1,0<j < p—2. It is easy to check from 
the structure of G that if i 4 0, the order of g is divisible by p. 

We also know from the Sylow theorems that G has a unique p- 
Sylow subgroup (and that subgroup is therefore normal). We see that 
that subgroup is N, and hence every element of order p must belong 
to N. 

With these observations in hand, let us consider a subgroup F' of 
G. There are two possibilities: 

Case 1: The order of F' is not divisible by p. In this case we 
must have F’ C H. But now we proceed as in Example 4.10.14. The 
subgroup F’ must be one of the subgroups Hg as in that example. 
Thus (following the notation there) we obtain the intermediate fields 


d= Fix( Hq) — Q(6a, Yn) with (E/ d) =d 
and (Ba/Q) = p(p — 1)/d. 
Case 2: The order of F is divisible by p. In this case we must have 


N C F. Now the subgroups of G containing the normal subgroup 
N are in 1—1 correspondence with the subgroups of G/N = H. 
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Hence such a subgroup must be NH, for some d. Thus we obtain 
the intermediate fields 


aq = Fix(N Hg) = Q(6a) with (E/Dq) = pd 
and (Da/Q) = (p — 1)/d. 


Since H is abelian, every such subgroup N#Hg is normal in G, 
which agrees with the fact that every such field Dg is a Galois exten- 
sion of Q. © 


4.11 More on Galois groups 


Suppose that E is a Galois extension of F and consider the Galois 
group G = Gal(E/F). We have already seen a number of examples in 
which we were able to completely determine G. In complete general- 
ity, this is a subtle and difficult problem. But there are some things 
we can say with only a moderate amount of effort, and we will say 
them here. 

The first basic theme of this section is that if extensions have a 
particular structure, we ought to be able to say something particular 
about their Galois groups. 

The second basic theme of this section is in a way the opposite, 
that if extensions have no particular structure, their Galois groups 
should be as general as possible. The path to our goal here will take us 
through symmetric functions, which are themselves very important. 

For the next few results we assume that all fields are contained in 
some large field A, so that it makes sense to talk about composition 
of field extensions. 

Recall Definition 4.3.9.: Two extensions B and D of F are disjoint 
if BOD=F. 


Theorem 4.11.1. Let B and D be finite extensions of F. Suppose 
that B is a Galois extension of F. Then is a Galois extension 
of D, and Gal(BD/D) is isomorphic to Gal(B/BND), a subgroup of 
Gal(B/F), with the isomorphism being given by restriction, ¢ > o|B. 

In particular, if in this situation B and D are disjoint extensions of 
F, then Gal(BD/D) is isomorphic to Gal(B/F) with the isomorphism 
being given by restriction. 
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Proof. Since B is a Galois extension of F, it is a splitting field 
of a separable polynomial f(x) € F[z], so B = F(ay,...,a,) where 
Q1,-.-,Qny are the roots of f(x) in A. But then = ID Gij, «ss e) 
SO is a splitting field of f(a) € D[a], and then is a Galois 
extension of 

Let o € Gal( /D). Then o fixes f(a) (as f(a) has coefficients in 


), so o permutes the roots {aj,.. 
. Thus the restriction of o to 


Let R: Gal( 


/D) > Gal(B/ 


a 


is an element of Gal(B/ 
) be the restriction map. “We 


.,Qn} of f(x), and hence o( 


‘a 


) 


want to show that R is an isomorphism. To do so, we must show that 
R is 1-1 and onto. 


R is 1-1: Suppose that ¢ = R(c) is the identity on 


. Then in 


particular o(a;) = a; for each 7, and then o(a;) = a; for each i. Also, 


a is the identity on 
identity on 


B 


isomorphism ao: 


Corollary 4.11.2. Let 
Suppose that 


, 1 


is a splitting 


/D). 
). Then oa: 
a 


BM 


field of f(a) € 


B 


+B 


( 


/D) = ( 


Proof. 


/F), 


and 


as it is an element of Gal( 
.€., o = id in Gal( 
R is onto: Let o € Gal(B/ 
phism, extending the identity on 
f(x), as f(x) has coefficients in F C BN 


/ 


B 


be disjoint finite extensions of I 
is a Galois extension of F. Then 


(BD/B) = (D/F), and ( 


i) = 


The first equality follows directly from Theorem 4.11.1 and 


(B/F)( 


Theorem 4.10.1. Then the others follow from Corollary 4.3.12. 


), so o is the 


B is an isomor- 
, so in particular a(f(z)) 
. As we have observed, 
[z], so by Lemma 4.7.3 there is an 
extending a, and so R(a) = 


/F). 


Example 4.11.3. Here is a simple example where we do not have 


equality. Let 


B= 


Q(x/2) and 


is an extension of Q of degree 6. 


Theorem 4.11.4. 


(a) 


Let B and 
extension of 
E is a Galois 


F, and let 


be disjoint finite extensions of 


E be the composite 


extension of F. Let G = Gal( 


H = Gal(E/ 


Ve 


= Q(63 7/2), disjoint extensions of Q, 
both of degree 3, neither of which is Galois. Then 


BD = Q(z, V2) 

) 

‘, with B a Galois 

E = BD. Suppose that 
E/P F), N = Gal(E/B), 


Then N is a normal subgroup of G and G is 


the semidirect product G = N = H. Also, H is isomorphic to 


Gal(B/F). 


332 


(b) 


An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


Let B and D be disjoint finite Galois extensions of F, and let E 
be the composite E = BD. Then E is a Galois extension of F. 
Let G = Gal(E/F), Ni = Gal(E/B), No = Gal(E/D). Then Ny, 
and Nog are normal subgroups of G, and G is the direct product 
G = N x No. Also, Ng is isomorphic to Gal(B/F) and Ny is 
isomorphic to Gal(D/F). 

Let E be a Galois extension of F and let G = Gal(E/F). Sup- 
pose that G = N x H is the semidirect product of a normal 
subgroup N and a subgroup H (resp. that G = N, x No is the 
direct product of normal subgroups N, and No). If B = Fix(N) 
and D = Fix(H) (resp. B = Fix(N1) and D = Fix(No)) then B is 
a Galois extension of F (resp. B and D are Galois extensions of 
F) and E= 


Proof. 


(a) 


(c) 


N and H are both subgroups of G. Since B is a Galois extension 
of F we have, from the fundamental theorem of Galois theory, 
that N is a normal subgroup of G and that Gal(B/F) is the quo- 
tient Hp = G/N. But we have an isomorphism from H to Ho 
given by restriction, by Theorem 4.11.1, and this is the same as 
the quotient map, by Remark 4.10.4(d). Hence G is the semidi- 
rect product G= N x H. 

First we must show that E is a Galois extension of F. Now Ny 
is a subgroup of G, so Fix(G) C Fix(N,) = B. Also, No is a 
subgroup of G, so Fix(G) C Fix(N2) = D. Then 


F C Fix(G) CBND=F 


so Fix(G) = F and E is a Galois extension of F. Now by part 
(a), G is the semidirect product G = N, = N2 with both N; and 
No» normal, so G is the direct product G = N, x No. 

This follows directly from the fundamental theorem of Galois 
theory and Lemma 4.10.5. 


Example 4.11.5. We have already seen an example of Theorem 
4.11.4 (a) in Example 4.8.7. 0) 


Example 4.11.6. We have already seen an example of Theorem 
4.11.4 (b) in Example 4.8.2. 0) 
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We now turn our attention to symmetric functions. We fix an 
arbitrary field Fo and let 


a Fo(21, s58 Bayle 


the field of rational functions in n commuting variables x1,...,2pn. 
Then the symmetric group S,, acts on E by permuting the variables. 


Definition 4.11.7. Let F = Fix(S,,). Then F is the field of symmet- 
ric functions in the n variables 71,...,2%, over Fo. © 


Lemma 4.11.8. Let H be a subgroup of S, and let B = Fix(H). 
Then E is a Galois extension of B with Gal(E/B) = H. In particular, 
E is a Galois extension of F with Gal(E/F) = Sy. 


Proof. This is just a special case of Corollary 4.9.13. 


Galois theory gives us a more concrete description of F. 


Definition 4.11.9. For J = {71,...,7,} a subset of {1,...,n} let x; 
be the product:27 =, ...%;,: (FE f=¢,er=1) 

For k = 0,...,n, the kth elementary symmetric function s; of 
{x1,...,%n} is defined by 


Sh= > xy where the sum is over all k-element 


subsets I of {1,...,n}. ©) 


Lemma 4.11.10. There is a polynomial identity: 


f(x) =(@—- 21)... (@& — tm) = $0 (-1)¥ spn. 


Proof. Direct computation. 


Theorem 4.11.11. The field F of symmetric functions in n vari- 
ables over Fo is 


F = Fo(so,..- sh) 


the field of rational functions in the elementary symmetric functions 
S0,-++55n- 
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Proof. Let D = Fo(so,..., 5). Certainly D C F. By Lemma 4.11.8, 
(E/F) = |S,| = n!. But we see from Lemma 4.11.10 that E is a 
splitting field of the polynomial f(a), a polynomial of degree n in 
[x], so (E/D) < n! Hence (E/D) = (E/F) = n!, so D =F. 


There is a lot to say about symmetric functions, but this is not 
the place to say them. Instead, we continue with Galois theory. 


Lemma 4.11.12. Let p(x) be a separable polynomial in Fx] and let 
K be a splitting field of p(x). Let G = Gal(E/F). Let {aj,...,an} 
be the distinct roots of p(x) in E, and let yp: G > Si), be given by 
p(a) = the permutation y(o)(a1,...,Qn) = (o(a1),...,0(An)). 


Let 


6= [I —a;) and A=6". 
i<j 
Then Im(py) C An, the alternating group, if and only if A is a 
square in F. 


Proof. It is easy to check that for any permutation p € Sp, 
p(d) = (—1)#™()9, 


Thus if y(G) C Ap, sign(y(o)) = 1 for every o € G, so 6 € 
Fix(G) = F and A is a square in F. 

On the other hand, if y(G) Z An, let oo € G with sign(y(o0)) = 
—1. Then y(a0)(6) = —6 so A = 6? with 6 not in F and so A is not 
a square in F. 


Now the point of this lemma is that A? is a symmetric function 
of {ay,.-.,@n}, so by Theorem 4.11.11 can be expressed in terms of 
the elementary symmetric functions s9,...,5,, and hence in terms 
of the coefficients of p(x). 


Lemma 4.11.13. 


(a) Let p(x) = 27 + ar +b € Fz] be a separable polynomial. Then 
A =a? — 4b. 

(b) Let p(x) = x3 + ax? + br +c € F{z] be a separable polynomial. 
Then A = —4a%c + a*b? + 18abe — 4b° — 27c*. 


Proof. Direct computation. 
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Example 4.11.14. Let E C C be a splitting field of the polynomial 
p(x) = 22 + 7 +1 € Qa] of Example. Then A = —1399, which is 
not a square in Q, so Gal(E/Q) is isomorphic to $3, as we saw there. 
But now we see that the quadratic extension of Q contained in E, 
which we simply described there as Q(6), is in fact Q(/—1399). © 


Example 4.11.15. Let p(x) = «° — 3x2 + 1, an irreducible polyno- 
mial in Q|z]. Then A = 81, which is a square in Q, so Gal(E/Q) is 
isomorphic to A3, which is itself isomorphic to the cyclic group of 
order 3. © 


Now let us return to general theory. 


Corollary 4.11.16. Every finite group is the Galois group Gal(E/F) 
of some finite Galois extension. 


Proof. Every finite group is isomorphic to a subgroup of a sym- 
metric group, by Cayley’s theorem (Theorem 2.8.4.), so this follows 
directly from Lemma 4.11.8. 


Remark 4.11.17. Of course, this shows that every finite group is 
the Galois group of a Galois extension of some field. It is natural to 
ask whether every finite group is the Galois group of some extension 
of Q. The answer to this question is unknown. © 


Suppose now that f(a) € F[z] is an irreducible separable polyno- 
mial of degree n and let E be a splitting field of f(x). Let f(x) have 
roots Q1,...,Q@, in E and consider G = Gal(E/F). We may think of 
constructing G in stages. First, we know that G acts transitively on 
the roots of f(x), so there is an isomorphism from F(a 1) to F(a;) for 
any 7=1,...,n, and there are n possibilities. Now suppose we have 
some such isomorphism, and we wish to extend it to an isomorphism 
from F(a;,a2) to F(a;,a;). We have at most n — 1 ways of doing 
this, and we should expect that “in general” there should be exactly 
n — 1 ways of doing this, that we would have such an isomorphism 
taking a2 to a; for any a; # a;. Of course, this might not be the 
case. In the most extreme situation, we could have E = F(a) and 
then any automorphism of E is determined by its effect on a;. But 
this precisely illustrates our point — in this situation we have some 
relationship between the roots of f(x) (each a; is a polynomial in a1). 
So if we have no relationship between the roots, we will indeed have 
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n—1 ways of doing this. Then we wish to extend this isomorphism to 
one from F(aj, a2, a3) to F(a;,a;,a,) and “in general” there should 
be n — 2 ways of doing this. Thus “in general” we should expect that 
there will be n(n — 1)(n — 2)...1 = n! automorphisms of E fixing 
F, or in other words that G = Gal(E/F) is isomorphic to the full 
symmetric group S,, and so G gives all permutations of the set of 
roots {a1,...,Q@,} of f(x). For example, this was the case in Theo- 
rem 4.11.11. Otherwise said, G will be smaller exactly when there are 
some “hidden relations” between the roots of f(#) which force auto- 
morphisms of E to have certain properties. (For example, if instead 
of beginning with an irreducible polynomial f(x), we could begin 
with a product f,(x) fo(x) of distinct irreducible separable polyno- 
mials fj(x) and f2(x), where f;(a#) has roots {aj,...,aj;} and fo(x) 
has roots {(1,...,(6,} in E; then any automorphism of E would have 
to leave each of the sets {a1,...,a;} and {(1,..., 8,} invariant.) 

While we have just expressed the expectation that “in general” 
if f(a) is an irreducible separable polynomial, G = Gal(E/F) should 
be isomorphic to S,, we are in the not uncommon situation where 
we have a general expectation but it requires some work to find an 
example. 

In fact, it is a theorem that for any n, there is a polynomial 
f(x) € Q|z] with Gal(E/Q) isomorphic to S,, where E C C isa 
splitting field of f(x). We will not prove this in general, but we will 
prove this when n is a prime. 


Lemma 4.11.18. Let p be a prime. Let f(x) € Q|a] be an irreducible 
polynomial with exactly p—2 real roots. Let E C C be a splitting field 
of f(x). Then G = Gal(E/Q) is isomorphic to Sp. 


Proof. Let aj,...,Qp—2 be the real roots of f(x). Then f(x) = 
(x — a1)... (X — Ap_2)q(x) for some quadratic polynomial q(x) with 
real coefficients, and then from the quadratic formula we see that 
q(x) has two complex roots 6, and 8 with B2 = 2, (the complex 
conjugate of 3}). 

Since f(x) is irreducible, G acts transitively on S = {aj,..., 
Qpy—2, 31, 22}. But we also see that complex conjugation leaves E 
invariant, and restricted to E acts as a transposition tT on S. (That 
is, T(a;) = a; for each i= 1,...,p— 2 and 7(61) = Bo, T(B2) = 1). 
Thus G is isomorphic to a transitive subgroup of 5S, that contains 
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a transposition, and by Lemma 2.10.15 any such subgroup must be 
the full symmetric group Sp. 


It remains to show that there is such a polynomial. We do that 
now. It is trivial for p = 2 so we assume p odd. 


Example 4.11.19. Let p be an odd prime. Let fo(x) be the poly- 
nomial 


fo(x) = 2°(a — 2)(a — 4)... (a — 2(p — 2)). 


We observe that fo(a) has roots x = 0 (a double root) and x = 
2,4,...,p—2. Then we see from elementary calculus that fo(x) has a 
local maximum at x = 0, a local minimum at x = a, for some value 
of ay with 0 < a, < 2, a local maximum at x = ag for some value of 
ag with 2 < ag < 4, and alternate local maxima and minima a; with 
{a;} “interleaved” between the roots of fo(x). Then f(x) is strictly 
increasing between + = a, and x = ag, strictly decreasing between 
Z=ag and ££ =asg,.... 

The local maxima of f(x) are at x = ag with 2 < ag <4, 2 = a4 
with 6 < a4 < 8, etc. Whatever the exact values of f(a2), f(a4),.-., 
are, we know that fo(a2) is the maximum value of f(x) on the interval 
[2,4], so in particular fo(a2) > fo(3) > 3?; similarly fo(a4) > fo(7) > 
77, etc. Thus, if we let 


f(x) = fol) — 2 = a°(a — 2)(e — 4)... (e — 2(p — 2)) — 2, 


we see that fo(x) has exactly p — 2 real roots: one between a; and 
aj, one between a2 and a3,..., one between apy—3 and ap_2, and one 
greater than a,)_2. 

Finally, we note that f(a) isa monic polynomial of degree p; every 
coefficient of f(z) other than the coefficient of x? is divisible by 2, 
and the constant term —2 is divisible by 2 but not by 4. Hence, by 
Eisenstein’s criterion, f(x) is irreducible. 0) 


4.12 Simple extensions 


Let E be a finite extension of F. We know that E = F(aj,...,Qn) 
for some elements Q1,...,Q@, of E, all of which are algebraic over F. 
That is, E is obtained from F by adjoining finitely many algebraic 
elements. 
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We have already asked whether any finite extension of F can be 
obtained by adjoining a single element a of E to F, so that E = F(a) 
for some element a of E. In this section we show that is true in very 
great generality (though not all the time). 


Definition 4.12.1. Let E be a finite extension of F. If there is some 
element a of E such that E = F(a), then E is a simple extension of 
F. Such an element a is called a primitive element of E over F. 


We shall abbreviate this to primitive element of E, when F is 
understood. 


Theorem 4.12.2 (Theorem of the primitive element). Let E 
be a finite separable extension of F. Then E has a primitive element 
a, and so E is a simple extension of F. 


Proof. First we consider the case where F is a finite field. Then E 
is a finite dimensional vector space over F, so has only finitely many 
elements, i.e., E is a finite field. But then we know from Corollary 
3.3.11 that E*, the multiplicative group of E, is cyclic. Let a be a 
generator. Then certainly E = F(a). 

Now suppose that F is infinite. Let E = F(aj,...,a,). First we 
claim that E is obtained from F by adjoining roots of a separable 
polynomial f(x) € F[z]. To see this, note that each a; is a root of 
some separable irreducible polynomial f;(x) € Fla]. Then we may 
take f(x) to be the product (or, more neatly, the least common mul- 
tiple) of fi(x),---, fn(2): 

Let us extend E to D, a splitting field of f(x). Since D is the 
splitting field of a separable polynomial, we know that D is a Galois 
extension of F (Theorem 4.9.12). Let G = Gal(D/F). Since E is an 
intermediate field between D and F, by the fundamental theorem of 
Galois theory we know that E = Fix(H) for some subgroup H of G. 

Now let B be any proper subfield of E. Then, again by the fun- 
damental theorem of Galois theory, we know that B = Fix(K) for 
some subgroup K of G that properly contains H. We also observe 
that dimp B = (B/F) < (E/F) = dim, E. 

Now G is a finite group, so has only finitely many subgroups. 


Thus 
LU B= U Fix(k) 


BCE HcCkK 
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I 


is a finite union of proper subspaces of E (regarded as an F-vector 
space). But then, by Theorem B.1, this union cannot be all of E. Let 
a be any element of E not in this union. Then E = F(a), so a is 
primitive and E is simple. 


Remark 4.12.3. A “general” element of E will not be a member of 
any finite union of proper subspaces of E so if we pick a “at random” 
we should expect that E = F(a). ?) 


Before proceeding further, we want to make an important 
observation. 


Remark 4.12.4. We have been careful to talk about a Galois group 
being isomorphic to a subgroup of a symmetric group S,, rather than 
a Galois group being a subgroup of a symmetric group S;,. Let us 
see why it has been necessary for us to be so careful. Consider, for 
example, two distinct irreducible separable polynomials p;(a) and 
p2(x) in Fla] of degrees n; and ng respectively. Set n = nj + neg. Let 
E be a splitting field of the product p1(x)p2(x). Then p(x) has roots 
{a1,...,Qn,} in E and po(x) has roots {(1,...,8n.} in E, and the 
action of the Galois group Gal(E/F) preserves each of these sets, so 
Gal(E/F) is isomorphic to a subgroup of S,, x Sp. C Sp; in particular 
Gal(E/F) is isomorphic to a nontransitive subgroup of S;,. On the 
other hand, from the theorem of the primitive element we know that 
K has a primitive element y. Let d = (E/F), and note that d is the 
degree of y over F. 

Then m,(x), the minimal polynomial of 7, is an irreducible poly- 
nomial of degree d, so has d roots {71,..., Ya} in E, and the action of 
the Galois group permutes these roots transitively, so G is isomorphic 
to a transitive subgroup of Sq. 

Thus, in general, given an extension E of F, the Galois group 
Gal(E/F) can be realized as a permutation group in very different 
ways. © 


Theorem 4.12.2 tells us that, if E is separable, a primitive element 
always exists. It is natural to ask how to find one. We begin with a 
general criterion for an element of E to be primitive. 


Lemma 4.12.5. Let E be a finite Galois extension of F. Then a € E 
is primitive if and only if o(a) 4 a for any o € Gal(E/F), o £ id, 
or, equivalently, if {a(a) | o € Gal(E/F)} are all distinct. 
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Proof. We show the contrapositive. By definition, a is not primi- 
tive if and only if F(a) is a proper subfield of E, and by the funda- 
mental theorem of Galois theory, this will be the case if and only if 
F(q) is fixed by some nontrivial subgroup H of Gal(E/F), which will 
be the case if and only if a is fixed by H. 


Example 4.12.6. Returning to Example 4.8.3, where E = 
Q(./a,..-,,/at), from the description of Gal(E/Q) we gave there 
we see that a = fay +--+ + \/@ is a primitive element of E 
over Q. © 


Here is one situation in which we can readily exhibit a primitive 
element. 


Lemma 4.12.7. Let E be a finite Galois extension of F such that 
G = Gal(E/F) is an abelian group. Let f(x) € F[a] be any irreducible 
as cae E as a splitting field, and let a be any root of f(x) 
in E. Then E = F(a), i.e., a is a primitive element of E. 


Proof. We prove the contrapositive. Let f(a) have roots a1,..., Qn 
in E, so that E = F(aj,...,a@,). Let @ = a, and suppose that E # 
F(a). For each i = 1,...,n, let H; be the subgroup of the Galois 
group that fixes a;, ie., Hj = {0 € G| o(a;) = a}. Then H; = 
Gal(E/F(a;)). 

Since f(z) is irreducible, G acts transitively on {a1,...,a@,} and 
so {H,,...,H,} are mutually conjugate subgroups of G. Further- 
more, these subgroups cannot all be the same, as then we would 
have F(a;) = Fix(H;) = Fix(H1) = F(a1) for each i, in which case 
E = F(ay,...,Qn) = F(a1)---F(an) (the composite) = F(a). 

Thus G contains the subgroup H, which is not normal, and so G 
is not abelian. 


Before proceeding further, it is illuminating to introduce a 
stronger notion. 


Definition 4.12.8. Let E be a finite Galois extension of F with 
Galois group G = Gal(E/F). If 6 € E with S = {o(0) |o € Gh a 
vector space basis for E over F, then S is called a normal basis for E 
over F and @ is a normal element of E over F. © 


Remark 4.12.9. 


(a) Normal basis is standard language but normal element is 
not. 
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(b) It is a theorem, which we will not prove here, that every finite 
Galois extension has a normal basis. 

(c) Since |G| = (E/F) (by Theorem 4.10.1), S is a basis for E if and 
only if S is linearly independent. © 


Lemma 4.12.10. Let E be a finite Galois extension of F. If 0 € E 
is normal, than 0 is primitive. 


Proof. If {o(@) | o € Gal(E/F)} is linearly independent, then cer- 
tainly all elements of this set are distinct, and so @ is primitive by 
Lemma 4.12.5. 


7h 


Example 4.12.11. Let a € Q not be a perfect square, and let E = 
Q(./a). Then \/a is a primitive but not normal element of E, while 
1+ /a is a normal element of E. .) 


Our theme for (almost) the remainder of this section will be to 
see that if we know primitive (or normal) elements of some field 
extensions, we can use them to obtain primitive (or normal) elements 
of other field extensions. 

Again we shall implicitly assume, when necessary, that all our 
fields are subfields of some larger field A, so that composition of 
fields makes sense. 


Remark 4.12.12. We see immediately that if a is a primitive (resp. 
normal) element of E over F and B is any field intermediate between 
EK and F, then a is a primitive (resp. normal) element of E over B. > 


Lemma 4.12.13. Let B and D be disjoint finite extensions of F with 
B a Galois extension of F. Then 6 € B is a primitive (resp. normal) 
element of over D if and only if 8 is a primitive (resp. normal) 
element of B over F. Also, 6 € D is a primitive element of BD over 
B if and only if 6 is a primitive element of D over F. 


Proof. The statement about ( follows directly from Theorem 
4.11.1. As for the statement about 6, let Do = F(0). Since B and 
are disjoint, certainly B and Dp are disjoint. But Do = D if and only 
if (Do/F) = (D/F), and BDp = BD if and only if (BDo/B) = (BD) /B, 
so this follows from Corollary 4.11.2. 


Lemma 4.12.14. Let E be a finite Galois extension of F and let 0 be 
a normal element of E over F. Let B be any field intermediate between 
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KE and F, so that B = Fix(H) for some subgroup H of G = Gal(E/F). 
Set 


ay = a a(0). 


o€H 


(a) IfB is a Galois extension of F (which will be the case if and only 
if H is a normal subgroup of G) then ay is a normal element of 

B over F. 

(b) In any case, ay is a primitive element of B over F. 


Proof. 


(a) Let H = Hj,..., Hy be the left cosets of H in G and let ay, = 
NoeH,o(O) for 7 = 1,...,t. It is straightforward to check that 
{aH,,-.-,@,} is a normal basis of B over F. 

(b) Let Bo be any proper subfield of B. Then Bp = Fix(K’) for some 
subgroup K of G properly containing H. Let p € K, p ¢ H. Then 
p © H; for some i ¥ 1, and then p(ay,) = ag,. In particular, 
p(ay) # ay, and so ag ¢ Bo. Thus we must have F(ay) = 
and so ay is a primitive element of B over F. 


Example 4.12.15. Returning to Example 4.10.13, we began there 
by considering E = Q(g,). Our argument there shows that ¢, is a 
normal element of E. We further showed there that, for each field Eg 
intermediate between E and Q, Eg = Q(6q). In fact, the construction 
of those elements 0g was just a special case of this lemma, and we 
see now that each 6g is not just primitive but also normal. © 


Theorem 4.12.16. Let B and D be disjoint finite extensions of F 
and let E be the composite E = . Suppose that B and E are Galois 
extensions of F. 


(a) Suppose that 8 € B is a primitive element of E over D, or, 
equivalently, that 8 is a primitive element of B over F, and that 
6 €D ts a primitive element of E over B, or, equivalently, that 6 
is a primitive element of D over F. If char(F) 4 0 or if char(F) = 
p and at least one of (E/D) and (E/B) is relatively prime to p, 
or, equivalently, if at least one of (B/F) and (D/F) is relatively 
prime to p, then e = 64+ 6 is a primitive element of E over F. 
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(b) Suppose that 6 € B is a normal element of E over D, or, 
equivalently, that 3 is a normal element of B over F, and that 
6 €D is a normal element of E over B, or, equivalently, in the 
case that D is a Galois extension of F, that 6 is a normal element 
of D over F. Then € = 06 is a normal element of E over F. 


Proof. First note that the “equivalently” in the statement of the 
theorem follow from Theorem 4.11.1, Corollary 4.11.2, or Lemma 
4.12.13. 


(a) We have, by Theorem 4.11.4, that G = Gal(E/F) is the semidi- 
rect product of its normal subgroup N = Gal(E/B) and its sub- 
group H = Gal(E/D). In particular, every element of G can 
be written uniquely as or with o € N and 7 € H, and then 
ot = To! with o =7~or EN. We will apply Lemma 4.12.4 to 
show that « = 6 +6 is a primitive element of E over F. Suppose 
ot(e) =e. Then 


B+6 =a7(8+6) = o7(B)+o7(5) = T0'(B)+07(5) = 7(8)+a(5) 


and so 
B—7(8) =0(6) — 64. 


But, from Theorem 4.11.1, 7(@) € B and o(d) € D. Hence 6 — 
7(8) =0(6) -de BND=F. 
If char(F) = 0 choose p to be either of o and r. If char(F) = p 
then at least one of o and 7 has order k relatively prime to p (as 
|N| = (E/B) and |H| = (E/D)); choose p to be such an element. 
If p = o, choose y = 0, while if p = 7 choose y = §. Then in 
any case we have p(y) = y+ a for some a € F. But then, since 
p* = id, we have y = p*(y) = y+ka so a = 0. But then p = id, 
and then we see that both o = id and 7 = id, soor =id €G 
and by Lemma 4.12.4 € is a primitive element of E over F. 

(b) This follows directly from our construction of a basis of E over 
F in Lemma 4.3.11. (We observed there in general that we had a 
spanning set, but given Corollary 4.11.2, here this spanning set 
is a basis.) 


Example 4.12.17. We return to Example 4.8.3. Beginning with 
Example 4.12.11, and applying Theorem 4.12.16 inductively, we see 
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that ag ay is a primitive element of E over Q (as we did in 
Example 4.12.6) and that IIi_,(1 + ,/@) is a normal element of E 
over Q. 0) 


Example 4.12.18. We consider the field E of Example 4.8.7, E = 


Q(G, Yn) = QA(G)Q( Wn), with these two fields being disjoint exten- 
sions of Q. Then, from Theorem 4.12.16, we see that ¢, + #/n is a 
primitive element of E over Q. 


In Theorem 4.12.16 we obtained primitive elements by adding 
primitive elements, and normal elements by multiplying normal ele- 
ments. Let us see that the roles of addition and multiplication cannot 
be interchanged. 


Example 4.12.19. Let E = Q(v2, V3). Then 2 is a primitive 
element of Q(V2), and V3 is a primitive element of Q(V3), but 
(V/2)(V3) = V6 is not a primitive element of E. 
Also, 1+ V2 is a normal element of Q(V2), and —1+ V3 is a 
normal element of Q(V/3), but (1 + V2) + (-14 V3) = V24 V3 is 
0 


not a normal element of E. 


Theorem 4.12.16 had a number of hypotheses. First, it required 
that the extensions B and D be disjoint. Here is an example to show 
we need this hypothesis. 


Example 4.12.20. Let B = Q(vV2, V3) and D = Q(V3, V5), so that 
R= = Q(v2, V3, /5). Then /2+ V3 isa primitive element of 
over Q, and —V/3+ V5 is a primitive element of D over Q, but 
(/2 + V3) + (-V3 + V5) = /2+ V5 is not a primitive element of 
E over Q. 

Also, (1 + V2)(1 + V3) is a normal element of B over Q, and 
(1— V3)(1+ V5) is a normal element of D over Q, but ((1+ V2)(1+ 
V3))((1- V3) (1+ V/5)) = —2(1+- V2)(1+ V5) is not a normal element 
of E over Q. ‘. 


Next, Theorem 4.12.16 required that at least one of the two inter- 
mediate extensions be Galois. Here is an example to show we need 
this hypothesis. 


Example 4.12.21. Let ¢3 be a primitive cube root of 1 and recall 
that 14+ 6 +2 = 0. Let B = Q(V2) and D = Q(G V2). Then 
. = = Q(72, 6372) = Q(¢3, V2) is a Galois extension of Q of 
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degree 6. Then V2 is a primitive element of B and ¢3 7/2 is a primitive 
element of D, but </ D4 (37/2 = -~GW/2 is not a primitive element of 
E over Q. 0) 


Finally, Theorem 4.12.16 had a condition on the degrees of the 
two intermediate extensions in case char(F) = p. Here is an example 
to show we need this hypothesis. 


Example 4.12.22. Let F = F,,(t,, t2), the field of rational functions 
in two variables tj and tg over F,. Let fi(z) = xv? — x — t; and 
fo(z) = x? — x — te, polynomials in F{z], and let E be a splitting 
field of f(x) fo(x). Then E = F(s1,s2) = F(si)F(s2), where B = 
F(s,) is obtained by adjoining a root of f;() to F and D = F(sg) is 
obtained by adjoining a root of f(x) to F. Thus s} — 5; —t; = 0 and 
a — s9 —ty = 0. But then 


(s) +1)? — (81 +1) -th =sf+1—-(s1+1) -th =s/-—s1-t =0 
1 


and similarly (s2 + 1)? — (s2 + 1) = tg = 0. Then the roots of fi (zx) 
in B (or E) are {s,,5, +1,...,51 + (p — 1)} and the roots of fo(z) 
in D (or E) are {s2,59+1,...,52 -+(p—1)}. We then see that G = 
Gal(E/F) is isomorphic to Z, x Zp, with generators o and 7 acting 
by o(s1) = $1 + 1,0(s2) = 82 and 7(s1) = 51, T(s2) = G +1. Now 8 
is a primitive element of B and s2 is a primitive element of D, but 


ot '(s1 + 82) =o7 *(81) +077 *(s2) = o(81) + 77 *(82) 


= (8s; +1) + (sg -1) = 51+ 82 


so, by Lemma 4.12.5, s; + s2 is not a primitive element of E 
over F. © 


We close this section with an example of an extension that is not 
simple. 


Example 4.12.23. Let F = F,(t,,t2) be the field of rational func- 
tions in two variables t; and tz over F,, as in the last example. Let 
fi(z) = x? — ty and fo(x) = x? —te, polynomials in F[z], and let E be 
a splitting field of fi (x) fo(x). Then E = F(s,)F(s2), where B = F(s1) 
is obtained by adjoining a root of fi(x) to F and D = F(sg) is 
obtained by adjoining a root of f2(x) to F. Note that neither B nor 
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(nor E) is a separable extension of F: x? — t; = (a — 51)? in Bia] and 
a? — ty = (x — 82)” in D[z]. Now E is an extension of F of degree p?, 
but it is easy to check that if a@ is any element of E, then a? € F, 
so F(a) is an extension of F of degree 1 or p, and hence F(a) ¢ E. 
Thus, E does not contain a primitive element over F, i.e., E is not a 
simple extension of F. © 


4.13 Finite fields 


With the help of Galois theory, we can readily determine the struc- 
ture of finite fields. 


Theorem 4.13.1. Let p be a prime. 


(a) For any positive integer n, there is a field Fyn containing p” 
elements. This field is a splitting field of the polynomial f(x) = 
xP" — x €F,[a], and is unique up to isomorphism. 

(b) Let m <n. Ifm does not divide n, then Fyn does not contain a 
field of p” elements. If m divides n, Fyn contains a unique field 
of p™ elements. 

(c) The Galois group Gal(Fpx/F) is a cyclic group of order n, gen- 
erated by the Frobenius automorphism ®: Fyn — Fyn given by 
O(a) =a’. 


Proof. 


(a) Let E be any field in which f(x) splits. Then, by Corollary 4.9.17, 
f(x) has p” distinct roots in E. Let E= {a € E| f(a) = 0}. We 
claim that E is a field. Certainly 0 € E and 1€ E. If aj,a2 € E 
then 


f(ar + a2) = (ay + a)?” _ (a1 + Q2) 


= (a? — a1) + (a8 — a2) =0+0=0 


so a1 + a2 € E. Also, 


nm nm nm 
f(a1a2) = (ara2)? —ayag =a’ ab —ajay = aja2—a a2 = 0 
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so a1Qa2 € E. Also, if a, 4 0, 
f(1/oy) = (1/ay)?” — (1/o1) = 1/a®" —1/ay = 1/o1,—1/a; = 0. 
Thus we see that E is a field. E consists entirely of the roots of 
f(x), so E has exactly p” elements. Clearly f(x) splits in E, and 
cannot split in any proper subfield of E, as such a subfield would 
contain fewer than p” elements. Thus E is a field of order p” that 
is a splitting field of the polynomial f(x). Then E is unique up 
to isomorphism by Lemma 4.7.3. 

(b) Let G = Fi, be the multiplicative group of Fpn. Then G is a 
group of p” — 1 elements, and G is cyclic by Corollary 3.3.11. 
Let m <n. It is easy to check that if m does not divide n, 
p'™ — 1 does not divide p” — 1, and so G cannot have a subgroup 
containing p'™ — 1 elements, and so Fyn cannot have a subfield 
containing p’™ elements. On the other hand, if m divides n, then 
p’™ — 1 divides p” — 1, so G contains a unique subgroup H with 
p™ — 1 elements. Let B = HU {0}. If a € B, a ¥ O, then 
a” — a = a(a?"—! — 1) = a(0) = 0, and certainly 0?" — 0 = 0. 
Thus, by the same argument as in part (a), B is a subfield of Fyn 
containing p’™” elements. 

(c) On the one hand, (Fp)/F,) = dimg,, Fyn = n. On the other hand, 
by Theorem 4.10.1, (Fp»/F) = |Gal(Fpx/F)|. Thus this Galois 
group is a group of order n. But ® € Gal(Fpn/F) and © is an 
element of this group of order n by Lemma 4.8.8. Thus we must 
have that Gal(F,»/F) is the cyclic group of order n generated 
by ®. 


4.14 Cyclotomic fields 


We have already investigated the pth cyclotomic field Q(G,). We now 
investigate the nth cyclotomic field Q(¢,) for an arbitrary positive 
integer n. 


Definition 4.14.1. Let n be a positive integer. Let ¢, = exp(27i/n). 
Then Q(¢,,) is the nth cyclotomic field. 


Definition 4.14.2. Let ¢ € C with ¢” = 1. Then ¢ is a primitive 
nth root of 1 if ¢” # 1 for any positive integer m less than (or, 
equivalently, properly dividing) n. % 
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Definition 4.14.3. The nth cyclotomic polynomial ®,(a) is defined 
by 


(x) = [[(@- 4 € Cia] 
where the product is taken over the primitive nth roots of 1. © 


Lemma 4.14.4. Let n be an arbitrary positive integer. Then 
x” -1=|] a(z). 
d\n 


Proof. The roots of «”—1 are all the complex nth roots of 1. But 
every complex nth root of 1 is a primitive dth root of 1 for some 
unique d dividing n, so grouping them together yields the right-hand 
side. 


Theorem 4.14.5. Let n be an arbitrary positive integer. Then ®,,(x) 
is a polynomial with integer coefficients, i.e., Pn (x) € Zlz]. 


Proof. We prove this by complete induction on n. 

For n= 1, ®\(@) =x-—1€ Za]. 

Now suppose the theorem is true for every d < n. By Lemma 
4.14.4, we have that 


f(v) =a" —1= (I'Gq(2)) ®n(a) = g(x) Gn(2). 


where the product inside the parentheses is taken over the proper 
divisors of n. Clearly g(x) divides f(x) in C[z]. But f(x) € Q/z] and 
by the inductive hypothesis g(x) € Q/z]. But then, by Lemma 4.2.17, 
g(x) divides f(x) in Qa]. Thus, f(z) = g(x)h(x) with h(x) € 
Q|z] so ®,(z) = h(x) € QIz]. But then, by Gauss’s Lemma 
®,(x) € Z[z]. 


Remark 4.14.6. Note that Lemma 4.14.4 gives us an inductive pro- 
cedure for finding ®,(x). Namely, 


®p(2) = (x" — 1)/ (TBq(e)) 
where the product is taken over the proper divisors d of n. © 


Theorem 4.14.7. For every positive integer n, the nth cyclotomic 
polynomial ®,,(x) is irreducible in Z|x] (or, equivalently, in Q|z]). 
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Proof (Dedekind). We prove this theorem by contradiction. 

Suppose that ®,,(«) is not irreducible. Then ®,,(x) has a monic 
irreducible factor f(x) in Q|z] with f(¢,) = 0 but with f(¢) 4 0 
where ¢ is some primitive nth root of 1. Now every primitive nth 
root of 1 is of the form ¢ = ¢* for some k relatively prime to n. 
Choose the smallest such k with f(¢*) 4 0 and let p be any prime 
factor of k. 

Let Co = Ch’? so that f(Go) = 0 but (cb) = f(¢) #0. 

Then f(z) is an irreducible monic polynomial in Q|z] with f(¢o) = 
0, and k(a#) = x” — 1 is a monic polynomial in Q|z] with k(¢o) = 0, 
so f(x) divides k(x) in Q[z], ie., k(x) = f(x)g(x) with f(x), g(x) € 
Q|z]. But then, by Gauss’s Lemma, f(x), g(x) € Z[x]. Now 


0=k(C) = F(C)9(¢) with f(¢) £0, so g(¢) = 0. 


Let g(x) = Dez", so g(¢) = Nei? = Le;(¢)*, so h(Go) = 0 where 
h(x) is the polynomial h(x) = Nex". 

Then f(z) is an irreducible polynomial in Q|z] with f(¢o) = 0, 
and h(2) is a polynomial in Q[z] with h(¢o) = 0, so f(x) divides h(x) 
in Q[z], and by Gauss’s Lemma again, f(x) divides h(x) in Z[z]. 
Observe that h(x) = g(x?). 

Now let a: Z[x] + Z,|r] = F,|z] be the map given by reducing 
coefficients (mod p). Then 


m(f(x)) divides m(h(x)) = m(g(a?)) = (g(a)? in Z,[a]), 


so m(f(a#)) and m(g(x)) have an irreducible factor m(a) in common 
in Z,[x] = F,[z]. 

Now k(x) = f(x)g(x) so r(k(x)) = 7(f(x))m(g(x)) so we see that 
m(k(z)) is divisible by m(zx)? in F,[a]. But then k(x) = as (x)) would 
have a multiple root in a splitting field E. But k(x) = x" —1 with n 
relatively prime to p, so k'(r) = nx"! 4 0 in F,[z], so by the proof 
of Lemma 4.9.16 this cannot be the case. 


Corollary 4.14.8. Let n be a positive integer. Then Q(G,) is an 
extension of Q of degree p(n), where y(n) is the Euler totient 
function. 


Proof. Q(¢,) is obtained from Q by adjoining a root ¢, of the 
cyclotomic polynomial ®,,(2), an irreducible polynomial of degree 


y(n). 
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We now study the relationship between different cyclotomic fields. 


Lemma 4.14.9. Let m and n be positive integers. Set g = gcd(m,n) 


and £ = Icm(m,n). Then Q(Gn)Q(Gn) = Q(Ge) and Q(Gn) 
Q(Gr) = Q(G,). In particular, if m and n are relatively prime then 


Q(Gm) QC) = QGmn) and Q(Gm) A Q(Gn) = Q. 


Proof. Since Gm = (¢)/™ and ¢, = (C¢)"/” we certainly have that 


On the other hand, we know that we can write g = mxo + nyo for 
some integers 70, y9. We also know that mn = gf. Thus, 


1/€ = g/mn = (mxo + nyo)/mn = x0/n + yo/m 
and so 
y= exp(2mi/€) = exp(2mi(t0/n + Yo/m)) 
= exp((2mi/n)xo) exp((2mi/m)yo) 
re ge 


so Q(¢e) © Q(Gm)Q(Gn) and hence they are equal. 

Let E = Q(Gm) 1 Q(Gn) and set e = (E/Q). Since Cy = (Gm)/9 = 
(Gn)"/9 we see that Q(¢,) C E. Now Q(Gm) and Q(¢,) are disjoint 
Galois extensions of E, so by Corollary 4.11.2, 


(Q(6e)/E) = (Q(6m)Q(Gn)/E) 
= (Q(6m)/E)(Q(6n)/E) 
and so 
(Q(¢e)/E) (E/Q)(E/Q) 
= (Q(m)/E)(E/Q)(Q(¢n)/E)(E/Q), 
(Q(e)/Q) (E/Q) = (QGm)/Q)(Q&n)/Q), 


But it follows from our earlier work (see Lemma 3.6.16) that 
p(L)y(g) = y(m)y(n) for any two positive integers m and n, so 
e = p(g) and E = Q(¢y). 
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Finally, we look at the Galois group. 


Theorem 4.14.10. Let n be a positive integer. Then the Galois 
group Gal(Q(Cn)/Q) is isomorphic to Z*, the multiplicative group 
of Zn. 


Proof. The cyclotomic field Q(¢,) is obtained from Q by adjoining 
the element ¢,, a root of the irreducible polynomial ®,,(x), and so 
there is a unique element o of Gal(Q(¢,)/Q) with o(¢,) = ¢* for 
every integer k (mod n) that is relatively prime to n. 


4.15 Solvability and unsolvability of equations 


Greek mathematicians at the time of Euclid knew how to solve 
quadratic equations, and the quadratic formula is very old. There 
matters stood for a long time, until in the 16th century Cardano and 
Tartaglia derived a formula for solving cubic equations, and Ferrari 
derived a formula for solving quartic equations. 

The next case is quintic equations, which mathematicians worked 
on for centuries — indeed, this problem spurred on the development 
of algebra. Then, in 1824, Abel showed that is no quintic formula, 
ie., that the general quintic cannot be solved by radicals. 

In this section we will develop a criterion for an equation to be 
solvable by radicals, and apply it to show that the general equation 
of degree greater than or equal to five cannot be solved by radicals. 

We suppose in this section that all fields we are dealing with have 
characteristic 0. We fix a ground field F. 

First we must make the notion of solvability by radicals precise. 


Definition 4.15.1. An equation f(x) = 0, f(x) € Fz], is solvable by 
radicals if there is a sequence of extensions F = Eg C E, C--- C Ex 
with E, D E, a splitting field of f(x), where for each i = 1,...,k, 
E, is a splitting field of a polynomial f;(x) € E’'(x) of the form 
fi(w) = 2" -e. 


Remark 4.15.2. Note that, for example, if n = pq, then Ye = 
{/¥/e. Thus, we may assume in Definition 4.15.1 that each integer n 
is a prime. v) 


Recall that we defined solvable groups in Definition 2.9.16. 
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Here is our main theorem. 


Theorem 4.15.3. If the equation f(x) = 0 is solvable by radicals 
then the Galois group G = Gal(E/F) is a solvable group. 


Proof. Suppose that f(x) = 0 is solvable by radicals. In Definition 
4.15.1, let the polynomial f;(z) have degree p;, where, by Remark 
4.15.2, we may assume that each p; is prime. 

Since E; is a splitting field of fi(z) € E,_1[z], E; is a Galois 
extension of E;_1, for each 7 = 1,...,k. But E; is not necessarily a 
Galois extension of F. We begin by constructing a new sequence of 
extensions F = Ej C E, C--- C E; with each E} a Galois extension 
of F. We do so PaNEUEY as follows: 

We begin with Ej = Eo = F. Now suppose that E}_, is a Galois 
extension of F. We have, by assumption, that E; is a splitting field 
of a polynomial f;(x) = x?! — e with e € E,_;. Consider instead the 
polynomial 


gi(x) = |] (w” — o(e)) 


oO 


where the product is taken over all o € Gal(E;_;/F). Since g;(z) 
is invariant under the action of Gal(E;_;/F), and E;_, is a Galois 
extension of F, we have that g;(~) € F[z]. Then let Ei = BE, 
where B is a splitting field of g;(x) over F. As both B and E,_ are 
Galois extensions of F, E;, is a Galois extension of F; indeed it is a 
splitting field of the product gi-1(x)gi(x) € Fla]. Note E; C E; for 
each i, and so E C Ej. 
Since E, is a Galois erenston of F, it is ? Galois extension of 
every intermediate field E/. Furthermore each E; is a Galois extension 
of E,_,. Thus, if G = Gal(E//F), H; = Gal(E;,/E/), then by the 
fundamental theorem of Galois theory Gal(E//F) is isomorphic to 
G/H;, and Gal(E}_,/F) is isomorphic to G/H;_1. But then H;_1/H; 
is isomorphic to Gal(E;/E/_,). We now Cat this up 

For simplicity of notation let B = E;_, and D = E;. Let D be 
a splitting field of II{_,x2? — a; where ~ is a prime a one Bui = 
1,...,¢. Then = Ble», Bi,.-- 58s) where 8? = a,j. 

Let o = B(G,) and for 7 = 1,...,t¢ let D; = Dj_1(G;) (so that 


=D). 
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Now Dp = BQ(G,) so Gal(Do/B) is isomorphic to a subgroup of 
Gal(Q(¢p)/Q). This group is cyclic so Gal(Do/B) is cyclic as well. 
For i = 1,...,¢ consider Gal(D;/D;_). There are two possibilities: 

If a; € Dj_; then D; = D;_; and this Galois group is trivial. 

If a; ¢ Dj; then by Abel’s theorem the polynomial x? — a; is 
irreducible in D;_;. On the other hand, it splits as (a — 6;)...(a — 

P18.) in D,[z]. Hence, this Galois group is {o9,...,%p—1} where 
o;(8;) = GG, j =0,-..,p — 1, and is a cyclic group of order p. 

Hence, putting all of these extensions together we see that we 

have a subnormal series for G’ = Gal(E;,/F) of the form 


G=G) 2G, D...2G,=f{1} 


with G‘_,/G', cyclic for each i = 1,...,s8, and so G’ is solvable. 

Now E, a splitting field of f(x), is a Galois extension of F and 
E C E,. Thus, Gal(E/F) = G’/Gal(E},/E) is a quotient of a solvable 
group, and so is solvable as well. 


Remark 4.15.4. The converse of Theorem 4.15.3 is true as well, but 
we shall not prove this here. © 


Corollary 4.15.5. The general equation f(x) = 0 of degree greater 
than or equal to five is not solvable by radicals. 


Proof. For n > 5, let F be the field of symmetric functions 
F = Fo(s1,...,5n) in the variables x71,...,%p,Fo an arbitrary field 
of characteristic 0, and let E be the splitting field of the polyno- 
mial f(x) = (w©—21)...(a@ — a) € Fla]. Then Gal(E/F) is isomor- 
phic to the symmetric group S,, and S;, is not a solvable group for 
n>od. 


Remark 4.15.6. Up to sign, s1,...,5, are the coefficients of the 
polynomial f(a) above, so this shows there is no general formula for 
the roots of a polynomial of degree at least five in terms of radical 
expressions in its coefficients. 

But this leaves open the possibility that for a general polynomial 
f(x) € Q{a], it may be possible to express its roots in terms of radical 
expressions in its coefficients, even if there is no general formula for 
doing so. This is also impossible, as we now see. © 
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Corollary 4.15.7. The general equation f(x) =0 with f(x) € Q{z] 
a polynomial of degree greater than or equal to five is not solvable by 
radicals. 


Proof. Let f(x) have degree n. If n = p is prime, we exhibited in 
Example 4.11.19 a polynomial f(x) € Q|z] of degree p with Gal(E/Q) 
isomorphic to the symmetric group S,, where E is a splitting field of 
f(x). This is in fact true for n any positive integer, though we will 
not prove this here. 


4.16 Straightedge and compass constructions 


In this section, we consider the question, which goes back to Euclid, of 
which geometric constructions can be performed by straightedge and 
compass. To answer this question we first translate it into algebraic 
terms, and then apply the field theory we have developed to first 
show that the three classical problems of antiquity — trisecting the 
angle, doubling the cube, and squaring the circle — cannot be solved 
by straightedge and compass constructions, and then to determine 
exactly which regular polygons can be constructed by straightedge 
and compass (modulo a question in number theory). 

We begin by observing that we can certainly draw a line L in the 
plane just using a straightedge. We do so, and pick two distinct points 
O and P on L. We declare O to be the origin and the distance from O 
to P to be 1. Then we may construct the line L’ that is perpendicular 
to L at O by straightedge and compass, use a compass to draw the 
circle of radius OP centered at O, and let P’ be a point at which 
this circle intersects L’. 

We then declare P to be the point with coordinates (1,0), and P’ 
to be the point with coordinates (0,1), so that L is the x-axis and L’ 
is the y-axis. This gives coordinates on the plane. 

We next observe that we can perform the four basic arithmetic 
operations — addition, subtraction, multiplication, and division — 
by straightedge and compass. We can certainly add and subtract 
lengths. We may multiply and divide lengths as follows: Given a 
length x and a length y, draw two intersecting lines Ly and Ly. Let 
their intersection point be O,. Mark off a distance O; P; on Ly equal 
to x and a distance OP; on L2 equal to y. Mark off a distance O;Q, 
on Ly equal to 1, and draw the straight line Q; Py. Then draw the line 
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through P,; parallel to this straight line, and let this line intersect Lo 
at the point Q2. Then, by similar triangles, the line segment O,Q2 
has length xy. Alternatively, draw the straight line P; Po, and then 
draw the line through Q, parallel to this straight line, and let this 
line intersect L2 at the point Q3. Then, by similar triangles, the line 
segment O,Q3 has length y/z. 

We may also take the square root of a positive real number x as 
follows: Draw a line L through a point O, and let P, be a point on 
L at a distance x from O,. Let Pp be a point on LZ on the other side 
of O; from P, at a distance of 1 from O,. Let C be the midpoint 
of the line segment from P; to P2 and draw a semicircle centered 
at C passing through P; and P3. Construct the line perpendicular 
to the line LZ at Oj; and let it intersect this semicircle at the point 
Q. Then P,QP> is a triangle inscribed in a semicircle, so is a right 
triangle, and then the triangles Q; P,Q and O,QP» are similar. Let 
w be the length of the line segment O,Q. Then, by similar triangles, 
1/w = w/z and sow = /Z. 

Finally, let us identify the point with coordinates (x,y) with the 
complex number z = x + iy. Since addition, subtraction, multiplica- 
tion, and division of complex numbers may be performed in terms 
of these operations on their real and imaginary parts, we may per- 
form all of these operations by straightedge and compass construc- 
tions. Furthermore, we may take square roots of complex numbers 
by straightedge and compass: Write the complex number w in polar 
form as w = re’. Then z has a square root Jre'®/ 2. We have already 
seen we can find ,/r, and we can bisect angles as well. 


Theorem 4.16.1. The complex number z can be constructed by 
straightedge and compass if and only if there is a sequence of fields 


Q=Fo CF, Cc-::- CF, 


with z € Fx, or, equivalently, with Q(z) C Fx, and with (F;/F;-1) = 2 
fori=1,...,k. 


Proof. First suppose there is such a sequence of fields. We proceed 
by induction on 7. If 2 = 0, every element of Q can be constructed, 
as we have shown. Now suppose every element of F;_1; can be con- 
structed. Since (F;/F;_1) = 2, we see from the quadratic formula that 
F; = F;-1(,/%1) for some 2_1 € F;_1. But we have seen that we 
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may construct \/w for any complex number w so we may construct 
/%-—1 and then every element of F;. 

Conversely, suppose that z can be constructed by straightedge 
and compass. Then z is obtained by a sequence of the following 
operations: 


(1) Finding the intersection of two lines. 
(2) Finding the intersection of a line and a circle. 
(3) Finding the intersection of two circles. 


Suppose at any stage of the process the points we are dealing with 
all lie in a field F. (At the start, F = Q.) In case of operation (1), 
both lines are given by linear equations with coefficients in F, so their 
intersection is a point in F. In case of operation (2), the line is given 
by a linear equation with coefficients in F, and the circle is given by 
a quadratic equation with coefficients in F, so their intersection is a 
point in F’, where either F’ = F or (F’/F) = 2. In case of operation 
(3), each circle is given by a quadratic equation with coefficients in 
F, but by subtracting a multiple of one of these equations from the 
other we may obtain a system consisting of one linear equation and 
one quadratic equation. 

Thus, performing these operations repeatedly to obtain the com- 
plex number z shows that z is an element of a field F, obtained as a 
sequence of quadratic extensions. 


Theorem 4.16.2. It is impossible to perform each of the following 
by straightedge and compass constructions: 


(1) Trisecting the angle 
(2) Doubling the cube 
(3) Squaring the circle 


Proof. 


(1) We may find an angle y if and only if we can find the point A 
that is the intersection of the line making an angle of y with the 
positive x-axis at the origin with the unit circle. That point A 
represents the complex number z = e’? = cos(y)+isin(y). Thus, 
we can construct A by straightedge and compass if and only if 
we can construct cos(y) and sin(y). Now recall the triple-angle 
formula: cos(3y) = 4cos?(y) — 3 cos(y). Setting y = 6/3, we see 
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4cos? (8/3) — 3cos(0/3) = cos(@). Now let 6 = 7/3, an angle we 
certainly can construct. Then cos(@) = 1/2, so that cos(@/3) is 
a root of the cubic polynomial 4x3 — 3x = 1/2, or equivalently 
82> — 62 — 1 = 0. But this polynomial has no linear factor, so is 
irreducible over Q, and hence (Q(cos(7/9))/Q) = 3. But 3 does 
not divide 2" (for any k), so we cannot have Q(cos(m/9)) C Q(Fx) 
for any field F; as in Theorem 4.16.1. 

(2) Since we can find a cube of volume 1 (the cube whose sides 
all have length 1), doubling the cube means finding a cube of 
volume 2, and hence a cube with side length « = V2 is a root of 
the irreducible polynomial «* — 2 and this is again impossible as 
i. (1). 

(3) Squaring the circle means finding a square that has the same area 
as a given circle. Beginning with a circle of radius 1, that means 
finding a square of side length \/7. But now we use a famous 
theorem of Lindemann, that 7 is transcendental, i.e., that 7 is not 
a root of any algebraic equation with coefficients in Q, so this is 
impossible. 


We now turn to the question of the constructibility of the regular 
n-gon by straightedge and compass. We see right away that this is 
equivalent to the constructibility of an angle of 27/n. 

Euclid knew how to construct a regular n-gon for n = 3,4, 5. 
If we can construct an angle of 27/n, and an angle of 27/n2, and 
n, and ng are relatively prime, then we can construct an angle of 
27r/n where n = nyn2, as follows: By Euclid’s algorithm we can 
write 1 = nya — ngb for some positive integers a and 6, and then 
1/(nin2) = (1/n2)a — (1/n1)b. So we construct a times an angle of 
2n/n2 and subtract from it b times an angle of 27/n 1 to obtain an 
angle of 27/n. Also, since we can bisect an angle, if we can construct 
an angle of 27/n, we can construct an angle of 27/(2*n), for any 
k. Thus, we can construct a regular n-gon for n = 2°3°5° with 
eg > 0, ey =O orl, eg =Oorl. 

There matters stood until Gauss showed how to construct a reg- 
ular 17-gon. This discovery, the first advance in this problem in 
over two millennia, played a role in Gauss’s decision to become a 
mathematician. 

We now go to work ourselves. 

We first reformulate the condition in Theorem 4.16.1. 
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Lemma 4.16.3. For a complex number z, the following are 
equivalent: 


(1) There is a sequence of fields 
Q=KRychc:-ck 


with z € Fp, or, equivalently, with Q(z) C Fx, and with 
( af Fea) = 2 fori=1,...,k. 

(2) If E is a splitting field of the minimal polynomial m,(x), then 
(E/Q) is a power of 2. 


Proof. (2) implies (1): Let G = Gal(E/Q). Then |G| is a power 
of 2, say |G| = 2*. But then by Lemma 2.9.6 there is a sequence of 
subgroups 


G=Gy > Gi >" 3 G;=—41} 


with |G;| = 2*-*, or, equivalently, [G;_1: Gj] = 2. Let F; = Fix(G)). 
(1) implies (2): We shall show that there is a sequence of fields 


Q= Eo CE, c-::: C Ey 


— 


with z € Ex, or equivalently with Q(z) C Ex, where each E; is a 
Galois extension of Q, F; C E;, and with (E;/E;_1) a power of 2, for 
each i. Then (Ex/Q) = (Ex/Ex—1)---(E1/Eo) is a power of 2, and 
E C Ex, so (E/Q) divides (E,/Q) and hence (E/Q) is a power of 2. 
We do so by induction on k. For i = 0 this is trivial, and for 
1 = 1 we may choose E, = Fy, as Fy is a quadratic extension of Q 
so F, is a splitting field of a polynomial x? — a, a € Q, and hence is 
automatically a Galois extension of Q. 
Now suppose this is true for i — 1. F; is a quadratic extension 
of F;-1, so, as we have just observed, F; is a splitting field of a 
polynomial f(x) = 2? — a for some a € F,;_; C E,_. Let 


g(x) = [J o( F(z) = []@? - o(a)) 


Co 


where the product is taken over all o € Gal(E;_;)/Q). Let E; bea 
splitting field of g;(x) € E;_1. But note that in fact g;(a) € Q/z], as 
gi(x) is invariant under Gal(E;_1/Q). Then E; is a Galois extension 
of F, as it is a splitting field of the polynomial g;_1(x)g;(x). It remains 
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to show that (E;/E,;_,) is a power of 2. To this end, let us denote 
the elements of Gal(E;_;/Q) by {o1,...,0;}, so that g;(x) = [] (x? — 
ai(a)). Let 8; € E; with 8? = o;(a), so that 


Eg = Bea Pi yaxns Pe) 


and so we have a sequence of fields 


Met Cea ( 81) © Ey (1, Ba) C +++ C By a Br, «.., By) = Be 


Let us examine each of these intermediate extensions 
Bp Pigs Peer) Spd (Diywda Bp 7) = BA Bigces 4 GQ). 


Either B; © Wy {Bivins Beat), in which case E; VBigssn5 By) = 
Ej-1(61,..., 8-1) and (Ej_1(61,...,8;)/Ei_1(61,...,8j-1)) = 1, 
or 6; ¢ Ey-1(61,...,8;-1). But in this latter case E;_1(61,..., 8;) 
is obtained from E;_1((1,...,8;-1) by adjoining 6;, which is a 
root of the quadratic polynomial x“ — o(a;), and so in this case 


(gg (Bt;.+ + 87) / Mea (Bi,.<587-1)) = 2. Thus, 
(E;/E,-1) = (Ey-1(61, ..-, Be) /Ei—1(61,---, Bi-1)) - - (Ex_1 (81) /E_-1) 


is a power of 2. 


In preparation for our main result, we do a bit of elementary 
number theory. We ask when n = 2‘ + 1 can be prime. We have the 
algebraic identity, valid for any odd a, 7+1 = (2+1)(x* 1 -— 2%? + 
---—a2+1). Thus, it t has an odd factor greater than 1, say t = rs 
with r odd greater than 1, then this identity yields 


Plas 1 SOF ei ee ae 7) 


and so 2‘ + 1 is composite. Thus, the only possible primes of this 
form are when t is a power of 2, t = 2" for some k. We let 


Fy = 2? +1. 
If Fy, is prime, it is called a Fermat prime. This terminology is due 


to the fact that Fermat believed that Fy is prime for every k. Indeed, 
Fo = 3, F, = 5, Fy = 17, Fz = 257, and Fy = 65537 are prime, but 
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Euler discovered that F5 = 4294967297 is divisible by 641 and so is 
composite. 


Theorem 4.16.4. A regular n-gon is constructible by straightedge 
and compass if and only if n is of the form 


n= 2", +++ Dj 
where p1,...,p; are distinct Fermat primes. 


Proof. As we have observed, constructing a regular n-gon is the 
same problem as constructing an angle of 27/n, and this is the same 
problem as constructing the complex number ¢,, = exp(27i/n). By 
Lemma 4.16.3 this is possible if and only if the splitting field E of the 
minimal polynomial m¢, (x) is an extension of Q of degree a power of 
2. But the polynomial mg, (x) is just the nth cyclotomic polynomial 
®,,(x), since ¢, is a root of this polynomial, and since, as we have 
shown, this polynomial is irreducible, and E = Q(¢,). Thus, 


(E/Q) = deg ®n(x) = y(n). 


Now factor n as a product of prime powers, n = 2p -+ +P,’ 
Then 


o(n) = 9(2")o(p?) ... e(p7) 
= (24-1) (pl (p, — 1))... (pf "(pj — 1). 


In order to have y(n) be a power of 2, it is necessary and sufficient 
that e] = --- =e; = 1, and that p; — 1 is a power of 2 for each j, ie., 
that p; is a Fermat prime for each i. 


Remark 4.16.5. We are almost totally ignorant about Fermat 
primes. There is no known value of & > 4 for which Fy is prime. 
And in our present state of knowledge any of the following alterna- 
tives may be true: There may be no values of k > 4 for which F;, 
is prime; there may be some but only finitely many values of k > 4 
for which F; is prime; there may be infinitely many values of k > 4 
for which F; is prime and infinitely many values of k > 4 for which 
Fy, is composite; there may be only finitely many values of k > 4 for 
which Fy; is composite and for all other values of Fy, with k > 4 F, 
is prime. We just don’t know. © 
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In this section, we prove the fundamental theorem of algebra: Every 
nonconstant complex polynomial has a complex root. 

But first let us note that, despite its name, there can be no purely 
algebraic proof of this theorem. To see why, let us consider the field 
of complex numbers C. How is C defined? It is C = R(i) where 
i? = —1. Thus C is an algebraic extension of R of degree two. But how 
is the field of real numbers R defined? The definition of R involves 
completeness (i.e., the analytical/topological notion of limits) and 
that is not an algebraic definition at all. So if we can’t even define R 
or C purely algebraically, we certainly can’t hope to prove anything 
about these fields purely algebraically. 

But what we can hope to do is to prove the fundamental theorem 
of algebra with only a minimal use of non-algebraic methods, and 
that is what we do here. We begin with a result from elementary 
calculus, and once we have that, we proceed purely algebraically. 

(We remark that there are purely analytic proofs of the funda- 
mental theorem of algebra — see any complex analysis textbook.) 

Here is the only non-algebraic fact we will use. 


Lemma 4.17.1. Let f(x) € R[x] be a polynomial of odd degree. Then 
f(x) has a real root. 


Proof. This is a familiar result from elementary calculus (to be 
fair, a result that is always stated in an elementary calculus course, 
but not proved until a more advanced course). 


Theorem 4.17.2 (Fundamental theorem of algebra). Let 
f(x) € Cla] be a nonconstant polynomial. Then f(x) splits into a 


product of linear factors in C[x], so that f(x) has n = deg f(x) roots 
in C. 


Proof. First we observe that it suffices to prove this if f(x) € 
R[x]. To see this, consider any polynomial g(x) € C[z]. Then f(x) = 
g(x)g(x) € Riz]. If f(x) splits in C[z], then so does g(x) (and g(z)). 
Thus, let f(z) € R[z]. Let E be a splitting field of f(x). Let 
(E/R) = 2d with d odd. Let G = Gal(E/R). Then |G| = 27d. 
Let H be the 2-Sylow subgroup of G, and let B be the fixed 
field of H. Then (B/R) = d. By the theorem of the primitive element 
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B = R(a) for some a, and then its minimal polynomial m,(x) € R[z] 
is a polynomial of odd degree d. But then by Lemma 4.17.1 ma(z) 
has a root a € R, ie., mq(z) is divisible by x—a in R[a]. Since ma (x) 
is irreducible, we must have mg(xz) = x—a,sod=1and B=R. 

Thus, (E/R) = 2™, and H = G. We prove the theorem by induc- 
tion on m. 

Before doing so, we observe that every complex number has a 
complex square root. If z € C, we can write down explicitly a complex 
number w with w? = z. If z = 0, then w = 0. Otherwise, let c = 
a/(a* + b*). Ifa > 0 and b > 0, then w = Va? + 62(,/(1 +c)/2+ 

(1 —c)/2), with similar formulas for other values of a and b. 

What we will actually prove is that if (E/R) = 2™, then E is 
isomorphic to a subfield of C. 

In case m = 1, this is trivial: E = R. 

Now suppose this claim is true for all extensions of R of degree 
2™-1 and let (E/R) = 2. Now G = Gal(E/R) has order 2”, i.e., is 
a 2-group. Choose an element go of G of order 2, and let Gp be the 
subgroup of G generated by Go. Let Eg = Fix(Go). Then (E/Eg) = 2. 

Now (E/R) = (E/Eo)(Eo/R) so (Ep/R) = 271, ie., Eo is an 
extension of R of degree 2™~1, so is isomorphic to a subfield of C, 
by the inductive hypothesis. Choose any isomorphism and let Eo be 
the image of E under this isomorphism. 

Then E is an extension of Eg of degree 2, so E = Eo(a) for some a 
with ma(x) a polynomial of degree 2 with coefficients in Eg. But, as 
we observed, every complex number has a complex square root, so, 
from the quadratic formula, we see that M(x), the image of ma(zx) 
under the isomorphism from Ep to Ep, has a root w € C, so E is 
isomorphic to Eo(w), a subfield of C. Then, by induction, we are 
done. - 

Finally, since f(x) splits in E, and E is isomorphic to Eo(w), f(x) 
splits in E(w), and hence in C. 


4.18 Exercises 


1. (a) Let f(z) = 2° + 32? + 62 i 3, an irreducible polynomial in 
Q[z]. Let E = Q{z]/ < 23 + 32? eT >. Let a be a 
root of f(x) in E. Let 6, = 2a +1, 62 =a? +4a+4 6. Find 
B?, 8182, and 3. Also, find the minimal polynomials mag, (z) 
and mzg,(a) and find By! and By". 


(b) 


(e) 
. (a) 


(b) 
(c) 


- (a) 
(b) 
(c) 
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Let g(x) = 2* + 4x? + 22 + 2, an irreducible polynomial in 
Q|z]. Let E = Q[z]/ < x++42?+2r+2 >. Let a be a root of 
g(x) in E. Let 6, = a+2, Bo = a? +2a+5, 63 = a%+207+4+1. 
Find 62, 62, 63, 8182, 0103, and B2{3. Also, find the minimal 
polynomials mg, (x), mg,(x), mg,(x) and find B,', B+, and 


Let f(x) = a? + 3x? + 42 + 3, an irreducible polynomial in 
F5[z]. Let E = Fs[z]/ < 2? + 322+ 4r+3 >. Let a bea 
root of f(x) in E. Let 6; = 3a + 2, 62 = a? + 4a 4 2. Find 
B?, 8182, and 3. Also, find the minimal polynomials mag, (z) 
and mg,(z), and find 6,;+ and 5". 

Let g(x) = 244+ 234 3x? + 22 +2, an irreducible polynomial 
in F5[z]. Let E = Fs/[2]/ < a4 +22 +32? + 2242 >. Let 
a be a root of g(x) in E. Let 6) =a4+ 3, Bo =a +2a44, 
63 = a° +2a+1. Find Sf, 3, 63, 8152, 6163, and 6283. Also, 
find the minimal polynomials mg, (x), mg,(x), mg,(x) and 
find 6, ° 2, and 6°: 


Write down the addition and multiplication tables in F7. 
Write down the addition and multiplication tables in Fg. 
Write down the addition and multiplication tables in Fo. 


Factor the polynomial «4 — x into irreducibles in Fo[z]. 
Factor the polynomial «° — x into irreducibles in Fo[z]. 
Factor the polynomial 21° — x into irreducibles in Fa[z]. 
Factor the polynomial «° — x into irreducibles in F3[z]. 
Factor the polynomial x!° — x into irreducibles in F4[z]. 
Let f(x) = 2° +a +1, an irreducible polynomial in Fs[z]. 
Let E; = Fs[z]/ < f(x) > and let a be a root of f(x) in Ey. 
Find all the roots of f(a) in Ej. 
Let g(x) = a3 + x? 4+ 2, an irreducible polynomial in Fs[z]. 
Let Ey = Fs[2]/ < g(x) > and let 8 be a root of g(x) in Eo. 
Find all the roots of g(x) in Eg. 

The fields E, and Eg are isomorphic. Find an explicit iso- 
morphism y: E, > Eg. 

Factor f(x) = x° — 1 into irreducibles in F3[s]. 

Factor f(x) = x? — 1 into irreducibles in Fs[z]. 

Factor f(x) = x° — 1 into irreducibles in F7[z] 


. Let p and q be primes (not necessarily distinct). Show that the 
number of irreducible polynomials of degree q in F,[z] is (p? — 


P)/4- 
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Suppose that f(a) = 279 — x € Fs3[z] is factored into a product 
of irreducibles f(x) = gi(x)...g%(x) in F3[x]. For each positive 
integer d, how many of the polynomials g;(x) are of degree d? 


. Let f(x) be a polynomial of degree 15 in F,[x] and let E be a 


splitting field of f(x). Find all possible values of (E/F). 
(a) In each case, find the minimal polynomial mq(x) € Q[z]: 


(1) a= V27+10V3 
(2) a= 7/25 +22,/2 
(3) a= V-154 1073 
(4) a= V—-154+ 1272 


(b) In each case, find the minimal polynomial mag(x) € Fy7{z] 
(where square roots and cube roots are to be interpreted as 
taken in some extension field of F7): 


— 


Let p(x) € Fla] be an irreducible monic polynomial of degree n, 
and let C be the companion matrix of p(x) in R = M,,(F), the 
ring of n-by-n matrices with entries in F. Let 


n—-1 
EK = {Sac | ane | 
i=0 


(a) Show that E is a subring of R. 
(b) Show that E is field. 
(c) Show that the polynomial p(x) has a root in E. 


(Thus, E gives a “concrete” realization of the field F[x]/ < p(x) > 

of Kronecker’s theorem, Theorem 4.2.4.) 

Let f(x), g(x) € Fa] be monic irreducible polynomials. If f(x) 

and g(x) have a common root in some extension field E of F, 

show that f(x) = g(x). 

(a) Let E be an extension of F and let a,@ € E. Suppose that 
a+ 6 is algebraic over F of degree j and that @{ is algebraic 
over F of degree k. Show that each of a and £ is algebraic 
over F of degree at most 27k. 


14. 


15. 


16. 


17. 


18. 
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(b) Give an example where each of a and £ is algebraic over F 
of degree exactly 27k. 

(a) Let E be an extension of F of prime degree p and suppose that 

EK = F(a). Let g(x) € Fla] be any nonconstant polynomial 

of degree d < p and let 3 = g(a). Show that E = E({). 

(b) Let E be an extension of F of odd degree and suppose that 

KE = F(a). Let g(x) € F[a] be any quadratic polynomial and 
let 8 = g(a). Show that E = F(). 

(a) Let f(a) € Fla] be an irreducible polynomial of degree d. 
Let E be an extension of F with (E/F) relatively prime to d. 
Show that f(z) is irreducible in E[z’]. 

(b) Let @ and 8 be elements of E, an extension field of F, with 
minimal polynomials ma(x), mg(a) € Fla] of degrees s and 
t respectively. Let mg(a) € Bla] be the minimal polynomial 
of 8 over B = F(q). If s and ¢ are relatively prime, show that 
™g(x) = ma(z). 

(a) Solve the system of equations: 


(1) in the field of complex numbers C. 
(2) in the field F37, the finite field with 37 elements. 


(b) Solve the system of equations: 


g+y +z =7 
x +y? +27 =15 


a +y? +29 = 31 


(1) an. 

(2) in F37. 
Let p be a prime. Show that s,(1,2,,...,p—1) =0 (mod p) for 
k =1,...,p—2 and that s,_1(1,2,,...,p— 1) = —1 (mod p). 


Here sz is the kth elementary symmetric function. 

Find the Galois groups of the following polynomials over Q: 
(a) (x? — 3)(x° — 1) 

(b) (2? +3)(x° — 1) 
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(c) (x3 —1)(23 - 2) 
(d) (2° — 2)(a° — 3) 
(e) e*-—4 
(f) at+4 
(g) «®—2 
(h) 2® —3 


Let a € Q and suppose that a # +b? for any b € Q. Let E 


be a splitting field of the polynomial x4 — a € Q{z]. Show that 
Gal(E/Q) is isomorphic to Dg. (Compare Example 4.8.7.) 

Let E be a splitting field of a polynomial f(x) € Q|z] of degree 
4. Suppose that (E/Q) = 8. Show that f(x) is irreducible and 


that Gal(E/Q) is nonabelian. 


(a) Let n be a positive integer and let F be a field that contains 
a primitive nth root of 1, i.e., an element a with a” = 1 but 
™ 4 1 for any 1 < m <n. (For example, we could choose 


= Q(¢,).) Let E be a Galois extension of degree n with 
Gal(E/F) a cyclic group. If n = p is prime, show that E = 
(8) for some element (6 of E with 6” € F. (Thus if b = 8”, 
E is a splitting field of the polynomial f(x) = 2” —b € Fiz.) 


This result is true for any positive integer n, but is more 


difficult to prove in general. 


(b) Let % = mea) Bi € E. Show that {7,... 


mal basis for E over F. 


»Yn—1} is a nor- 


Let p be a prime and let F = F,. Let a be an integer relatively 
prime to p. Let f(x) € F[z] be the polynomial f(x) = 2? —x—-a. 


(a) If @ is a root of f(x) in an extension field E of F, factor f(x) 


in E. 
(b) Show that f(a) is irreducible in F{z]. 
(c) Let E be a splitting field of f(x). Then 


F of degree p. Describe the action of Gal(E/F) on E. 


E is an extension of 


(d) More generally, let F be a field of characteristic p and let 
f(x) = x? —x— a for some a € F. Suppose that f(x) does 
not have a root in F. Do parts (a), (b), and (c) in this more 


general situation. 


The field E is known as an Artin-Schreier extension of F. 
(e) Let F = F,, let a be an integer relatively prime to p, and let 
f(x) =a? — a —a for some positive integer n. Let B = Fyn 


23. 


24. 


25. 


26. 


Zt. 
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and let E be a splitting field of f(x). Show that (E/B) = p. 
Describe the action of Gal(E/F) on E and identify the sub- 
group Gal(E/B) of Gal(E/F). 


Let 
be t 


F = F,. Let k be an integer with k # 0,1 (mod p). Let d 
he order of k (mod p), i.e., the smallest positive integer d 


such that kt = 1 (mod p). Let f(x) be the polynomial f(x) = 
x? —kx —1 € F{[z]. Show that f(x) is the product of a linear 
factor and (p — 1)/d distinct irreducible factors of degree d in 


(a) 


(b) 


Let E be a Galois extension of F of degree n. 

If n is not a prime, show that there is some field B strictly 
intermediate between F and Ei.e., with FC BCE. 
Give an example of the following situation: E is a Galois 
extension of F of degree n, d is a divisor of n, but there is no 
field B intermediate between F and E that is an extension of 
F of degree d. 


Let d be an arbitrary positive integer. Show that there is a Galois 


exte 


(a) 


Let 
HK is 


nsion E of Q of degree d. 

Let G be a finite abelian group. Show that for some N, there 
is a field intermediate between Q and Q(¢n) with Gal(E/Q) 
isomorphic to G. In your solution, you may use Dirichlet’s 
famous theorem: Let a and b be relatively prime. Then there 
are infinitely many primes congruent to b (mod a). (Note 
that this implies the result of the preceding problem.) 

If G is not abelian, show that there is no such N. 

The Kronecker—-Weber theorem states that any Galois exten- 
sion of Q with abelian Galois group is isomorphic to a sub- 
field of some cyclotomic field (i.e., to a subfield of Q(¢y) for 
some N). 

KE = Q(¢7) where ¢7 is a primitive 7th root of 1. Note that 
an extension of Q of degree 6. Note also that E contains 


a unique subfield B that is an extension of Q of degree 2 and 


that 


degree 3. 


(a) 
(b) 


E contains a unique subfield D that is an extension of Q of 


Find an irreducible polynomial f(x) € Q[a] such that B = 


Q(G) where £ is a root of f(x). 
Find an irreducible polynomial g(x) € Q|a] such that D = 


Q(d) where 6 is a root of g(x). 
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28. Let D > 1 be a square-free positive integer, and let a = VD +i. 
Let ma(x) € Q|z] be the minimal polynomial of a, and let E 
be a splitting field of ma(x). For which positive integers n is 
Q(¢n) C E? (Of course, your answer will depend on D.) 

29. (a) Let p be a prime and let (k/p) be the Legendre symbol. Let 


p-1 
Sp = >_(k/p)exp(2nik/p). 
k=1 
Show that (S,)* = (—1/p)p. Thus S, = +,/p if p = 1 (mod 
4) and Sp, = +i,/p if p = 3 (mod 4). (The sum Sy is an 
example of a Gauss sum. It is a theorem of Gauss than in 
each case the sign is +. Gauss wrote that after thinking about 
this problem fruitlessly for a year, the solution came to him 
“like a bolt of lightning from the sky”.) 
(b) Let D be an arbitrary integer. Show that Q(WD) C Q(q,) 
for some n. 
(c) Let p be an odd prime and let D be an integer that is not a 
perfect pth power. Show that Q(W/D) ¢ Q(¢,) for any n. 
30. (a) Let p be a prime and let n be a positive integer. Show that 


oP” —1 
ger a 
(b) Let n be a positive odd integer. Show that 

®2,(x) = €®,(—2) 


®pn(x) = 


where € =—lifn=lande=-+1lifn>1. 

(c) Let p and q be distinct primes. Show that 
(xP4 — 1)(x — 1) 
® = 
pq(2) (xP — 1)(a4 _ 1) 


(d) More generally, let p be a prime and let n be a positive integer 
not divisible by p. Show that 


Onp() = On(2?)/On(a). 


(e) Let n be a positive integer and let k be the product of the 
distinct prime factors of n. Show that 


®,(x) = &,(x"/*). 


dl. 


32. 


33. 


34. 


35. 


36. 


37, 
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Let F = F, be the field with p elements and let m be an integer 
relatively prime to p. Let ®,,(a) € F[a] be the (mod p) reduction 
of ®,,(x), and let g(x) =a” —1€ Fiz]. 


(a) Show that ®,,(a) and g(a ) have the same splitting field E, 
and furthermore E = F,, where r is the smallest positive 
integer with p’ = 1 (mod m). (In particular, while ®,,(x) 
is always irreducible in Q[2], ®,,(2) is irreducible in F,[2] if 
and only if r = y(m).) 

(b) Let ©,,(a) = fi(x)... fy (x) be a factorization of ®,,(x) into 
a product of irreducible polynomials in F [x]. Show that each 
polynomial f;(a) has degree r, and hence that k = y(m)/r. 


Let f(x) € Fiz] have roots {a1,...,a,} in some splitting field 
EX. Show that f(x) is irreducible in F[a] if and only if 


[I (@ - 0) ¢ Fla] 


a,EeT 


for any nonempty proper subset T of S = {aj,...,Qn}. 
Let G be a finite group of automorphisms of a field E and let 
‘C E be its fixed field. Let a € E and let {a, =a,...,a,} be 
its orbit under the action of G. Show that a is dlecbreie over F of 
degree r with minimal polynomial mg(x) = (w—a)...(x—ary) € 
Let n > 2. Let E = Q(G,) and B = ENR. Show that (B/Q) = 
p(n) /2 and that ee = 2. Show that B = Q(cos(27/n)) and 
that E = B(i sin(27/n)). 
(a) For n = 7,8,9,10 find the minimal polynomial of cos(27/n) 
over Q. 
(b) For any n, find the minimal polynomial of i sin(27/n) over 
Q(cos(27/n)). 
For n = 7,8,9,10, let E = Q(G,). Find all fields intermediate 
between Q and E. For each intermediate field B, find Gal(B/Q) 
and Gal(E/B). Find a primitive element of B. Also, find a poly- 
nomial f(x) € Q|z] whose splitting field in B. 
For n = 7,8,9,10, let E be a splitting field of f(z) = x? — 
7 € Qa]. Find all fields intermediate between Q and E. For 
each intermediate field B, find Gal(B/Q) and Gal(E/B). Find a 
primitive element of B. Also, if B is a Galois extension of Q, find 
a polynomial g(x) € Q|z] whose splitting field is 
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In each case, let m_(x) denote the minimal polynomial of a over 
Q, and let E be a splitting field of ma(x). Let G = Gal(E/Q). 


1. (a) Let a = 735+ 10/10. Show that a = c+ Vd for some 
c,d € Q. Show that m,(2) is a quadratic, that (E/Q) = 2, 
and that G is isomorphic to Zo. 

(b) Let a = 20+ 10/3. Show that a = e+ ./f for some 
e,f € Q Show that ma(z) is a quartic, that (E/Q) = 4, 
and that G is isomorphic to Z2 @ Zg. 

2. (a) Let a = 54+ V5. Show that ma(zx) is a quartic, that 

(E/Q) = 4, and that G is isomorphic to Z4. 

(b) Let a = V104+ V5. Show that mo(z) is a quartic, that 

(E/Q) = 8, and that G is isomorphic to Dg. 


(Note that this problem is a special case of the next problem.) 

Let a = Va+ Vb, a,b, € Q, b not a perfect square. Let ma(x) be 
the minimal polynomial of a, ma(x) € Q|[z]. Let E be a splitting 
field of ma(x) over Q, and let G = Gal(E/Q). Show the following: 


(1) Suppose that a? — b is a perfect square in Q, a? — b = q’. 
(Note q is only defined up to sign) 


(a) If 2a+ 2q is a perfect square (for some choice of q), them 
M(x) is a quadratic, (E/Q) = 2, and G is isomorphic 
to Zy. (In this case, a = (c+ Vd)? for some c,d € Q.) 

(b) If 2a + 2q is not a perfect square (for any choice of q), 
then mq(x) is a quartic, (E/Q) = 4, and G is isomorphic 
to Z2®@Zz. (In this case, a = \/e+\/f for some e, f € Q.) 


(2) Suppose that a? —6 is not a perfect square in Q. Then ma(z) 
is a quartic. 

(a) If (a? — b)/b is a perfect square, then (E/Q) = 4 and G 

is isomorphic to Z4. 

(b) If (a? —b)/b is not a perfect square, then (E/Q) = 8 and 

G is isomorphic to Dg. 


Consider the polynomial f(x) = a4 + sx? +t € Q|a]. Use the 
preceding exercise to determine when f(z) is irreducible. In this 
case, let E be a splitting field of Q|z]. Determine (E/Q) and 
G = Gal(E/Q). 


Al. 


42. 


43. 


AA, 


45. 


46. 
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Let E be an extension of Q of degree 4. Show that there is a 
quadratic extension B of Q with B C E if and only if E = Q(a) 
where a is a root of an irreducible polynomial f(x) = 24+ sa? + 
t € Qlz]. 

Let E be a splitting field of the separable polynomial f(x) € F[z], 
and suppose that the group G = Gal(E/F) has no nontrivial 
proper normal subgroups. Show that E is a splitting field of some 
irreducible factor of f(a). Give a counterexample if G does not 
satisfy this condition. 

Let f(x) € Fla] be a separable irreducible polynomial of prime 
degree p, and let E be a splitting field of f(x). Let G = Gal(E/F). 
If G is not isomorphic to Z,, show that G is not abelian. 

Let p(x) € Fla] be an irreducible polynomial, and let E be a 
splitting field of p(a). Let a and { be any two roots of p(x) in 
EK. Let f(a) € F[z] be an arbitrary polynomial, and suppose that 
f(x) factors in E[z] as f(x) = g(x, a)h(z, a) for some polynomi- 
als g(x,y) and h(x, y) in F[z,y]. Show that f(x) also factors in 
Ela] as f(x) = g(x, B)A(a, B). 

Let B and D be any two extensions of F such that (BD)/F = 
(B/F)(D/F). Let B, be any field intermediate between F and 
and let D, be any field intermediate between F and D. Show that 
(B.D, /F) = (B1/F)(D1/F). 

Let B and D be disjoint Galois extensions of F. Let B be a 
splitting field of f(x) € Fla] and let D be a splitting field of 
g(x) € Flax]. Let 6 be a root of f(x) in B and let 6 be a root of 
g(x) in D. Show that 6+ 6 is a primitive element of F(,6). 
Let p and q be distinct primes, and let D be an integer that is 
neither a pth power nor a qth power. Show that the polynomial 
f(x) = xP? — D is irreducible in Q[z]. (Thus, for example, the 
polynomial x° — 72 is irreducible in Q[z]. Note that this polyno- 
mial does not satisfy the hypotheses of Eisenstein’s criterion.) 


An extension B of F is an abelian extension if it is a Galois 
extension of F whose Galois group Gal(B/F) is an abelian group. 


(a) Let E be a finite Galois extension of F. Show that there is 
a maximal abelian subextension Egy of F, i.e., that there is 
an abelian extension E,, of F with E,, C E, and if B is any 
abelian extension of F with B C E, then B C E, 

(b) If G = Gal(E/F) and G,, = Gal(E -b/E), identity Gabe 
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(c) If G is isomorphic to the alternating group A,, for n > 5, 


show that E,, = F. 


(d) IfG is isomorphic to the symmetric group S;,, for n > 5, show 
that (E,,/F) = 2. In this situation, suppose that char(F) 4 2 
and E is a splitting field of an irreducible separable polyno- 


mial f(x) € Fla]. Show that 


discriminant of the 


Let E be a Galois extension of F and let 


polynomial f(z). 


mediate between F and 


intermediate between B and 


(i.e., D is a Galois extension of F and if 


/ 


sion of F intermediate between B and E, then D C D’.) 


Gal(E/D) and Gal(D/F) 
Let f(a) € F{z] be an irreducible polynomial. Let 


Eab = F(VA), where A is the 


be any field inter- 
E. Show that there is a smallest field 
E that is a Galois extension of F 
is any Galois exten- 


Identify 


E be an exten- 


sion of F and suppose that f(x) factors as a product of irreducible 
polynomials f(x) = fi(x) ... f(x) in E[z]. 


(a) If E is a Galois extension of F, show that for each 7,7 there 


is an element oj; € 


(b) Give a counterexample to this if 


of F. 


Gal(E/F) such that oj; (fi(x)) = 


fj(2). 


E is not a Galois extension 


Let E be a splitting field of the irreducible separable polyno- 


mial f(x) € Fla], and let 


intermediate between F and 
a factorization of f(a) into irreducibles in 
polynomial has the same degree d. Furthermore, if a € 
root of f(a), show that d = (B(a) 


be a Galois extension of F that is 


B. Let f(e) = file) «+ Sele) be 
[x]. Show that each 


His any 


Ss 


) and that k = (BNF(a)/F). 


Let f(x) € F[z] be an irreducible separable polynomial of degree 
d, and let E be a splitting field of f(x). 


(a) For a root a of f(x) in E, let r(a@) be the number of 
f(x) in F(a). Show that r(qa) is independent of the choice of 
a. Call this common value r. 
(b) Let s be the number of distinct fields F(a), where a is a root 
of f(x) in E. Show that rs = d. 
(c) Give examples where r = 1, 1 < r < d, and r = d. (Choose 


ais 


roots of 


Let E be a splitting field of the irreducible separable polynomial 


f(x) € Fla]. Let d = ( 


K/F). For any root a of f(x) in 


E, and 
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any prime p dividing d, show that there is a field B intermediate 
between F and E with (E/B) = p and E = B(a). 

. Let F C B CE with E a Galois extension of F. Show that there 
exist irreducible polynomials f(x) € F[z] and g(x) € B[z] such 
that E is a splitting field of f(x), and that E is a splitting field 
of g(x), and g(x) divides f(x) in B[z]. 
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Chapter 5 


Rings of Algebraic Integers and 
Dedekind Rings 


In our investigation of rings, one of our principal interests has been 
the question of unique factorization. We have seen examples of rings 
that do have unique factorization, and examples of rings that do not. 

Our objective in this chapter is to show that in a very important 
class of examples, rings of algebraic integers, while we may not have 
unique factorization of elements, we always have unique factorization 
of ideals. 

We shall proceed in several stages. We will first define and study 
rings of algebraic integers. We will next define Dedekind rings, and 
show that rings of algebraic integers are always Dedekind rings. Then 
we shall show that we always have unique factorization of ideals in 
Dedekind rings. Then we will look at a bunch of examples. 

As you will see, field theory in general, and Galois theory in par- 
ticular, plays an essential role in our investigations here. 


5.1 Rings of algebraic integers 


Definition 5.1.1. An algebraic number field F is a finite extension 


of Q. v) 


Remark 5.1.2. As we have seen in Theorem 4.5.6, an algebraic 
number field is obtained from Q by adjoining a finite number of 
elements, each of which is algebraic over Q. In this chapter, we shall 
use algebraic to mean algebraic over Q. © 
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Definition 5.1.3. Let F be an algebraic number field. An element 
a of F is integral over Z, or is an algebraic integer, if the minimal 
polynomial mq(xz) € Q/z] of a over Q is a polynomial with coefficients 
in Z. We set Or = {a € F | a is an algebraic integer}. 

Or is the ring of algebraic integers in F. © 


Lemma 5.1.4. An element a of an algebraic number field F is an 
algebraic integer if and only if it is a root of some monic polynomial 


f(x) € Zia}. 


Proof. Ifma(x) € Z[z] then we may choose f(x) = mq(x). On the 
other hand, if a@ is a root of f(x) € Z[z] then ma(x) € Q/x] divides 
f(x), so Mq(x) € Zax] by Gauss’s lemma. 


Corollary 5.1.5. Let F=Q. Then a € Q is an algebraic integer if 
and only if a € Z, 1.e., Og = Z. 


Proof. The element a € Q has minimal polynomial m,(x) = x—a, 
and mq(x) € Z[x] if and only if a € Z. 


Remark 5.1.6. We will need to be careful in distinguishing between 
the integers (i.e., algebraic integers) in an algebraic number field 
F and the integers in Q. Thus, whenever there is the possibil- 
ity of confusion, we will refer to a € Z as a rational integer 
(i.e., an integer in the field of rational numbers). This is standard 
terminology. © 


Definition 5.1.7. In this chapter, we will refer to an abelian group 
as a Z-module. © 


Remark 5.1.8. There is a vast, and important, theory of modules 
over general rings. We do not want to get into that here, as we do 
not need it for our purposes. But we at least want to introduce the 
language, so you will become familiar with it. © 


By definition, Og is a subset of F. Our first job is to show that 
Or is a subring of F. In order to do so we develop a criterion for an 
element of F to be an algebraic integer. 


Definition 5.1.9. Let aj,...,@, be arbitrary elements of 
F. ‘Then ZloyscaceQy) = (play <:<5 Oy) | Big oeestty) SE 
Tl B4 5 «:8ee5¢ By | PST ) 
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Lemma 5.1.10. Let a © F. Then a is an integer in F if and only if 
Zla] is a finitely generated Z-module. 


Proof. Suppose a is an integer in F. Let its minimal polynomial 
Ma(x) € Zax] have degree n. Let 8 € Zia] so that 8 = p(a) for some 
polynomial p(x) € Z[x]. Then m,(x) divides p(x) in Q|z], and hence 
in Z|x], by Gauss’s lemma. Thus, 


p(x) = ma(x)q(x)+r(x), r(x) € Z[x] with r(x) =0 or deg r(x) <n, 


and so 8 = p(a) = ma(a)q(a) + r(a) = 0g(a) + r(a) = r(a). Thus, 
we see that Z[z] is generated by l,a,..., a”. 

On the other hand, suppose that Z[{] is generated by a finite set 
of elements {(1, ..., 8,}. Let 6; = pi(a), with p;(x) € Z[z] a polyno- 
mial of degree d;, for each i = 1, ..., k. Choose n > max(dj, ..., dx). 
Then a” € Z[a], so a” = S*_,m;8; = S*_,mip;(a) for some integers 
m1, ..-, Mx. Thus, if f(x) = 2” — U7 ymipi (x), then f(x) € Z[z] isa 
monic polynomial with integer coefficients of degree n with f(a) = 0. 
Thus, by Lemma 5.1.4, @ is an integer. 


Lemma 5.1.11. Let F be a field intermediate between Q and E. Then 
Og = FOR. 


Proof. Clear from Definition 5.1.3, as ma(x) is the same whether 
a is regarded as an element of F or an element of E. 


Theorem 5.1.12. Let F be an algebraic number field. Then Op is a 
ring with 1. 


Proof. mo(x) = x so 0 € Or and mji(x) = x —1s01 € Of. If 
Ma(x) = UPpaiz* then m_.(x) = U™%p(-1)"*az' so if a € OF 
then —a € Og. 

Now, let a,8 € Of. Then, by Lemma 5.10, Za] is generated 


by a finite set {a1,..., ax} and Z|G] is generated by a finite set 
{1, -.-, Be}. Then Zla, 8] is generated by the finite set {a;8; | i = 
1,...,k, 7=1,..., 2}, as any element of Zla, §] is a sum of product 


of polynomials in a and 6, each of which can be expressed in terms 
of these two sets. 

Thus Z[a, (] is a finitely generated Z-module, and it is certainly 
torsion-free as F is a field of characteristic 0. Thus, by Theorem 
2.6.32, it is a free Z-module. 
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Now Zia + 6] C Zia, 8] and Zia] C Zia, 6], so each of these 
is a free finitely-generated Z-module by Corollary 2.6.33, and hence, 
by Lemma 5.1.10, a+ 8 € Op and af € Of, and so Of is a ring 
with 1. 


Example 5.1.13. Let D be a squarefree integer, D 4 1, and let 
F = Q(VD), a quadratic extension of Q. Let O(WD) be the ring 
introduced in Example 3.1.15, 

a+b/D|a,b€Z} D=2,3 (mod 4) 


O(VD) = 
(VD) abv | a,b € Z,a=b (mod 2)} D=1 (mod 4). 


Then O(VD) = Og. © 


Let F be an extension of Q of degree n. Our next goal is to prove 
that Of is a free Z-module of rank n. This will take some work. 


Definition 5.1.14. Let E be a finite Galois extension of Q, with 
Galois group G = Gal(E/Q). 
For a € E, we let 


T(q@) is called the trace of a. > 


Remark 5.1.15. The trace is actually defined more generally. If E 
is a Galois extension of F with Galois group G then trg/p(a) = 
Yeego(a). But we will not need this more general definition 
here. rv) 


Lemma 5.1.16. Let E be a finite Galois extension of Q and let 
aé€E,a#0. Then there is an element 8 € E with T(a’) £0. 


Proof. Let a have degree n and let aj = a, ..., My be the Galois 
conjugates of a, i.e., the distinct elements of the set {o(a) | 0 € 
G}. Then mo(x) = (4 — a1) ... (© — Qn) and also mg(z) = a” + 
te aie with a; = (—1)'s,_; where {s1, ..., 5,} are the elementary 
symmetric functions in {a1, ..., Qn}. 
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Let 69 = 1 and fori=1,...,n-—1, let 6; be the ith elementary 
symmetric function in {a2, ..., @,}. Then for each i =, ..., 


T(aPi-1) = kis; for some integer k; 4 0. 


Thus if T(a{;-1) = 0 for each ¢ = 1,...,n, then s; = O for 
each i = 1,..., n, in which case mg(z) = x” (and then n = 1 and 
i,t) =o) and.w=0: 


Lemma 5.1.17. Let F be an algebraic number field and let a be an 
arbitrary element of F. Then there is a rational integer N such that 
Na is an integer in F. 


Proof. Consider ma(x) € Q[z]. We may “clear denominators” by 
multiplying m_(xz) by a nonzero integer N so that Nma(x) € Z[z]. 
Then 


n-1 
Nm,(2) = Na” + > bir’ with b; = Na; € Z for each i, 
i=0 


and, further multiplying by N”~!, we have 


n—-1 
N™mo(«) = Na” + ST b;N"- 1a" 

i=0 
n-1 

= N yr 4 ys b,N"—1-inigi 
i=0 
n-1 

= (Na)"+ > ci(Na)’ with c;=b;N"-'~* € Z for each i. 
i=0 


Thus if we let 8 = Na, mg(x) =x" + UP) G2" and we see that £ is 
an algebraic integer. 


Theorem 5.1.18. Let F be an algebraic number field with (F/Q) = 
m. Then Op is a free Z-module of rank m. 


Proof. F is a finite separable extension of Q so is obtained from Q 
by adjoining roots of a separable polynomial p(a). Let E > F bea 
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splitting field of p(x). Then E is a finite Galois extension of Q. Let 


n = (E/Q). 

Then E has a vector space basis {a1, ..., @,} over Q, and then, 
by Lemma 5.1.17, a vector space basis {(1, ..., 8n} with 8; € Og for 
each 7. Now consider the linear transformation U: E + Q” given by 

T (ap) 
U(a)= : 
T (an) 


This is a linear transformation between two Q-vector spaces, both 
of the same finite dimension n, and by Lemma 5.1.16 U(a) = 0 
implies a = 0, ie., U is 1 — 1. Then U must be an isomorphism. 
Thus for each i = 1, ..., , there is a unique element y; € E with 
U(%) = e, the vector in Q” whose ith entry is 1 and whose other 
entries are all 0, ie., 7(y%j8;) = 1 if j =7 and 0 if j 4 7. Observe also 
that {71, .--, Ym} is a basis for E over Q as {U(r1), ..., U(rn)} isa 
basis for {E”}. 

Now let a € Og be arbitrary. Then we may write a = UG 
with c; € Q for each 7. Since 8; € Og for each j, and Og is a ring, 
we have that a3; € Og for each 7, and then 


T(aBj) = S>T(ci7iB;) = S- iT (7:85) =. 
i=1 i=1 


But if a6; is an element of E of degree d, with mag, (x) = ie” Ae 
ag_1z* 1 +--+ with ag_1 € Z, then T(aB;) = (n/d)(—ag-1) € Z. 
Thus c; € Z for each j. Hence if we let C' be the Z-module generated 
by {y1, ---, %n} and B be the Z-module generated by {(1, ..., Bn} 
we have that 


BCOrCC. 


Of course, B and C (and Og) are torsion-free since E is a field of char- 
acteristic 0. Thus, B and C are each free of rank n, so by Corollary 
2.6.33 Og is free of rank n as well. 

Now Og = FM Og, so Of is a Z-submodule of a free Z-module 
of rank n, so is free as well, of some rank m’. But then m’ > m, as 
otherwise a basis of Og could not span F, and m > m’, as otherwise 
a basis of Og could not be linearly independent, so m’ = m. 
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Remark 5.1.19. Of course, Theorem 5.1.18 gives us the additive 
structure of Og, but says nothing about the multiplicative structure 
of Og, which depends on the particular field F (not just on the degree 
of F). 0 


5.2 Dedekind rings 


In order to define Dedekind rings we will have to first think more 
about integrality. 


Definition 5.2.1. Let R be a subring of the field F. An element a 
of F is R-integral if there is a monic polynomial f(x) € R[x] with 
f(a) = 0. A subring A of F is R-integral if every element a of A is 
R-integral. © 


Example 5.2.2. If R = Z, and F is a finite extension of Z, then a 
is R-integral if and only if a is an algebraic integer. 
Also, the ring of algebraic integers Op is R-integral. © 


Example 5.2.3. Again let R = Z and consider F = Q(VD), with D 
as in Example 5.1.13. Let n be any integer, n 4 0,+1, let E = n?D, 
and consider 


{a+bVE | a,beZ} D = 2,3 (mod 4) 
7 {tivE | a,b eZ, a=b (mod 2)} D=1 (mod 4). 
Observe that A is a subring of F, that A is Z-integral, and that 


the quotient field of A is F. 
Also, let D = 1 (mod 4) and consider 


A={a+bvD | a,b€ Z}. 


Again observe that A is a subring of F, that A is Z-integral, and 
that the quotient field of A is F. © 


The point of this example is that we may obtain the same field 
F as the quotient field of different subrings of F. But here there is 
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a “best” choice of subring, the ring of algebraic integers Or, and we 
have a criterion to decide what is “best.” 


Definition 5.2.4. Let F be a field and let R be a subring of F. A 
subring A of F is R-integrally closed if every element a of F that is 
integral over R is an element of A. © 


Example 5.2.5. The ring of algebraic integers Op of F is Z- 
integrally closed but the subrings A of Q(VD) in Example 5.2.3 
are not. 0) 


However, the situation is more complicated. We would like to look 
at a subring A and have a criterion that only depends on A and not 
on some other subring R. Here it is: 


Definition 5.2.6. Let A be an integral domain and let F be its 
quotient field. Then A is integrally closed in F if every a € F that is 
A-integral is an element of A. © 


The distinction between A being A-integrally closed and Z- 
integrally closed is a subtle one. We will need the notion of A- 
integrally closed to develop the theory of Dedekind rings, which we 
will define abstractly. But, as we shall see, if F is an algebraic number 
field, these notions coincide: Of is not only Z-integrally closed (that 
was its definition) but also Og-integrally closed. 

Now, we come to our main definition. 


Definition 5.2.7. An integral domain A is a Dedekind ring (or 
Dedekind domain) if: 


(a) A is Noetherian; 
(b) A is integrally closed in its quotient field; and 
(c) Every nonzero prime ideal of A is maximal. o) 


In fact, we have already seen many Dedekind rings. 
Theorem 5.2.8. Let A be a PID. Then A is a Dedekind ring. 


Proof. Let A be a PID. From Theorem 3.6.15, we know that A is 
Neotherian, and from Theorem 3.12.7, we know that every nonzero 
prime ideal is maximal. So it remains to show that A is integrally 
closed in its quotient field. Let a be an element of this quotient field 
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that is integral over A. Then a is a root of a polynomial 
f(z) = 2" +an—1e" * + +++ +a9 € Ala]. 


Write a = B/y with 8,7 relatively prime elements of A. (We can 
do so because A is a PID.) Then 


(B/7)" + an—1(8/7)"™* +++ +49 =0 
so, multiplying by y”, 
B" + an1B" y+ +++ +agy” =0, 
B” = y(—an_18" + — +++ — apy") 


and so we see that 6” is divisible by 7. Now, 6 and y are assumed to 
be relatively prime, so 6” and 7¥ are relatively prime as well. Hence, 
7 must be a unit and a= By! € A. 


We record the following observation. 


Lemma 5.2.9. Let A be a Dedekind ring. Then A is a PID if and 
only if A is a UFD. 


Proof. This is a special case of Theorem 3.12.10. 


Here is our next main goal. 


Theorem 5.2.10. Let F be an algebraic number field and let Op be 
the ring of algebraic integers of F. Then Op is a Dedekind ring. 


Proof. Let A = Of. We must show that A satisfies the three prop- 
erties of a Dedekind ring. 


(a) A is Noetherian: Let I; C Ig C I3 C ... be a sequence of ideals 
of A. We must show this sequence is eventually constant. To 
this end, let J = Ujs1J;. Now each J; is a Z-module, and so 
is their union J. Now I C Og, which, by Theorem 5.1.18, is a 
free Z-module of finite rank. Then I itself is a free Z-module 
of finite rank, by Corollary 2.6.33. (In fact, if J # 0, the rank 
of I is equal to the rank of Op, but we do not need this fact.) 


Let {a1, ..., @} generate I as a Z-module. Then they certainly 
generate J as an ideal of A. But for each 7, a; € I, for some k;. If 
we let k = max(ky, ..., kn), then a; € I, for everyi=1,..., n, 


so I = I, and hence Ip = Inia = Inte 


384 


(b) 
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A is integrally closed in its quotient field: Let a € F and suppose 
that a@ is integral over A. We must show that a € A. To this 
end, let a be a root of a monic polynomial p(x) € A[z]. We will 
show that a is in fact a root of a monic polynomial q(x) € Z[z], 
in which case a € A. 

Let E be a finite Galois extension of Q containing F. (We 
found such an extension in the proof of Theorem 5.1.18.) Let 
G = Gal(E/Q) and let 


ae) = T] o(p@)). 


o€G 


Then a is a root of q(x). Also, g(x) is a product of monic poly- 
nomials, so is monic. 

Now q(2) is invariant under the action of G, so q(x) € Q{z}, 
ie., every coefficient of g(x) is in Q. 

Furthermore, every coefficient of g(x) is a sum of products of 
coefficients of each polynomial o(p(z)), all of which are algebraic 
integers. (If 6 € E is an algebraic integer, then so is 0() for any 
o € G, as GB and o(8) have the same minimal polynomial.) But 
Of is a ring, by Theorem 5.1.12, so every coefficient of q(x) is an 
element of Or. Thus every coefficient of q(x) is in QM Of. But, 
by Lemma 5.1.11, QM Og = Og, and then, by Corollary 5.1.5, 
Oo = Z. 

Every nonzero prime ideal of A is maximal: In proving this, we 
will be repeatedly applying Theorem 3.12.8: Let R be a commu- 
tative ring with 1. Then an ideal J of R is prime (resp. maximal) 
if and only if the quotient R/J is an integral domain (resp. a 
field). 

Let P be a nonzero prime ideal of A. Let a € P, a € 0. Consider 
its minimum polynomial ma(x) € Z[z], 


Mo(x) = 2" + oe a oe ee 
Note that ag 4 0 as a £0 and m,(z) is irreducible. Then 
0= mela) = a” + an_10" 1 +--+ +49 


SO 


= sO" Se qe ae" = a) = P 
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Hence, Q = PNZ F {0}. Let i: Z > A be the inclusion and 
m: A — A/P be the projection. Then Ker(zi) = ZN P = Q, 
so Im(mi) = Z/Q C A/P. But P is a prime ideal of A so A/P 
is an integral domain and hence Z/Q is an integral domain. (If 
A/P has no zero divisors then certainly any subring of A/P has 
no zero divisors.) Since Z/Q is an integral domain, Q is a prime 
ideal in Z. But in Z, prime ideals are maximal, so Q is a maximal 
ideal and hence Z/Q is a field. 

Now A is integral over Z so A/P is integral over Z/Q (by looking 
at representatives). Also, A is finitely generated over Z so A/P 
is finitely generated over Z/Q (by the images of generators of A 
over Z). Thus, A/P is an integral domain that is a finite dimen- 
sional vector space over the field Z/Q, so by Lemma 4.3.8 A/P 
is a field. Hence, P is a maximal ideal of A. 


Corollary 5.2.11. Let F be an algebraic number field and let Or be 
the ring of algebraic integers of F. Then Og is a PID if and only if 
Op is a UFD. 


Proof. Immediate from Theorem 5.2.10 and Lemma 5.2.9. 


5.3 Ideals in Dedekind rings 


In this section, we reach our goal of establishing unique factorization 
of ideals in Dedekind rings. 

We first recall that if R is any commutative ring with 1, and I 
and J are ideals of R, their product IJ is the ideal of R generated by 
{a8 |ae€TI,6 € J}; more concretely, IJ = {finite sums Ha; 3; | a; € 
I, 6; € J}. 

We begin with a more general result. 


Lemma 5.3.1. Let R be a Noetherian integral domain. Then every 
nonzero ideal I of R contains a product of nonzero prime ideals. 


Proof. Suppose that R contains a nonzero ideal J that does not 
contain a product of nonzero prime ideals. Set 4 = J. Either J, is 
maximal among ideals with this property or it is not. If it is not, let 
I, C Ig where Ig has this property. Either J> is maximal among ideals 
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with this property or it is not. If it is not, let Ig C [3 where J3 has this 
property. In this way we obtain a sequence of ideals I; C Ig C.... 
But FR is Noetherian, so this sequence stops at some finite stage J,. 
Set J = J. Thus, J does not contain a product of nonzero prime 
ideals but any ideal J’ with J Cc J’ does. Now J is not a prime ideal 
(as then J would contain the prime ideal J). Thus, there are elements 
a and 8 of R witha ¢ J, 8 € J, but af € J. Then Jj, the ideal 
generated by J and a, properly contains J, so must contain a product 
of prime ideals, and J}, the ideal generated by J and £, properly 
contains J, so must contain a product of prime ideals. But then J} J} 
contains a product of prime ideals. But Jj J = (J+aR)(J+8R) = J 
as a3 € J; contradiction. 


Definition 5.3.2. Let R be an integral domain and let F be its 
quotient field. A subset I of F is a fractional ideal if 


(1) i+9 € I wheneveriel, jel 
(2) ri € I whenever re R,ie I 
(3) There is some nonzero element d of F such that dI C R. 


An element d as in (3) is called a denominator of I. ?) 


Remark 5.3.3. We see that fractional ideals are generalizations of 
ideals, as a fractional ideal of R is an ideal of R if (and only if) we 
can choose its denominator d = 1. 


Example 5.3.4. Let R be an integral domain with quotient field 
F. Let f ¢ F, f 4 0. Write f = c/d with c,d € R, and let J = 
{rf | r € R}. Then J is a (principal) fractional ideal of R with 
denominator d. ) 


Definition 5.3.5. A fractional ideal J of an integral domain R is 
invertible if there is a fractional ideal J of R with IJ = R. In this 
case we write J = I~! (and I= J+). .) 


Lemma 5.3.6. If I is invertible, then I~' is unique. 


Proof. If J; and Jo are both inverses of J, then 


Jp = {R= J(IJo) = (HD) Jg = RJp = Jo. 
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Example 5.3.7. Let R be an integral domain with quotient field F. 
Let J be a nonzero principal fractional ideal generated by f € F. Then 
I~' is the nonzero principal fractional ideal generated by f7!. © 


Lemma 5.3.8. Let A be a Dedekind ring with quotient field F. Let 
P be a nonzero prime ideal of A. Set 


N={8EF| BPC A}. 
Then: (a) N is a fractional ideal of A. 
(b) ACN. 
(c) PN = A, and hence N = P7. 
In particular, every nonzero prime ideal of A is invertible. 


Proof. (a) It is easy to check that properties (1) and (2) of a 
fractional ideal hold for N. Let a be any nonzero element of P. By 
the definition of N, for every 6 € N, Ga € P C A. Thus, a isa 
denominator of N. 

(b) Clearly A C N so we must show A # N. Choose a € P, 
a #0, and let I be the principal ideal generated by a. By Lemma 
5.3.1, J contains a product P, ... P, of prime ideals of A. We may 
suppose that J does not contain a product of fewer than n of these 
factors. Since J C P, P contains the product P, ... P, and hence P 
contains P; for some i. (Otherwise, choose y; € P;, y; ¢ P for each i. 
Then 71, .-., Yn € P, which is impossible, as P is a prime ideal.) We 
may suppose that P contains P;. But then P = P, as prime ideals 
of A are maximal. 

Ifn = 1, set J=A.Ifn > 1, set J = Py... P,. Then I does 
not contain J. Let y be any element of J with 7 not an element of 
I. Then, recalling that P = P,, 


Py CPJ =P\Ps .. Py GT = Ao, 


so P(ya~') C A. Thus if we set 6 = ya~!, then from the definition 
of N we see that 6 € N. (Here we take a! € F.) But 6 ¢ A, as if 
B€A then y = Ba € Aa = I; contradiction. 

(c) By the definition of N we have that PN C A. Since P C A, 
we have that 1 € N so P C PN. Now P is a prime ideal of A, hence 
a maximal ideal of A, so we must have PN = P or PN = A. We 
will show that PN = P is impossible. Then PN = A as claimed. 
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Suppose that PN = P. Let d be a denominator of the fractional 
ideal N. For any 6 in N, we have P6 C PN = P. But then 


P6? = (PB)\B C(PN)B =PBCPN=P 


and by induction we have that P* C P for every positive integer k, 
and hence 6* € N for every positive integer k, so d3" € A for every 
positive integer k. Thus if we let M be the subset of F generated by 
A and {6* | k = 1,2,...}, then M has denominator d and so is a 
fractional ideal of A. 

Now dM is an ideal of A, and A is Noetherian, so dM is 
finitely generated as an ideal of A, and hence M is finitely gener- 
ated as a fractional ideal of A. (If {71, ..., y%} generates dM, then 
{d-'y, ..., d~!yz} generates M.) 

In the special case that A = Op, we have seen that A is finitely 
generated as a Z-module, and hence M is finitely generated as a 
Z-module, and so § is an element of A. 

For A a general Dedekind ring, let {6),..., 6,} generate M as 
a fractional ideal of A. Then, by the definition of M, 6; = p;(@) 
for some polynomial p;(x) € A[z], for each 1 = 1, ..., &. Choose 
n > max(degree(p1(x)), ..., degree(pz(y))). Then 6" € M, so B" = 
D*_, aip:(8) for some elements a1, ..., a, of A. But then, if 


k 
f(z) =a" — 5° oupi(z), 
4=1 


f(x) isa monic polynomial in A[z] with f(3) = 0. Hence, @ is integral 

over A. But A is integrally closed in F, so ( is an element of A. 
Now ( was an arbitrary element of NV, so we have N = A. But 

this contradicts (b). 


Here is our main result. 


Theorem 5.3.9. Let A be a Dedekind ring with quotient field F. 
Then any nonzero fractional ideal I of A can be expressed as a product 


=P ea 
for some mutually distinct prime ideals Py, ..., Py of A and nonzero 
integers €1,..., €k, and this expression is unique up to the order of 


the factors. (Here we regard A as the empty product.) 
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Proof. First we prove the existence of the factorization of J as a 
product of prime ideals and then we prove uniqueness. 

We begin with the case that J is an ideal of A, and we will show 
in this case that each of the exponents is positive. 

Suppose there is some ideal J; that cannot be expressed in this 
way. If J; is maximal among ideals of A with this property, stop. 
Otherwise J; C Iz where Ig has this property. If [2 is maximal among 
ideals of A with this property, stop. Otherwise Jz C I3 where [3 has 
this property. Keep going. 

In this way we obtain a sequence of ideals 1; C Ig C Ig C.... 
But A is a Noetherian ring, so this sequence must stop at some Jy. 
Set J = I,. Then J cannot be expressed as a product of prime ideals, 
but every ideal J properly containing J can be. 

Now I # A, as A is the empty product. Since A is Noetherian, 
by a similar sort of argument J is contained in a maximal ideal P,. 
Since A is a Dedekind ring, P; is a prime ideal, and then by Lemma 
5.3.8 P, is invertible. 

Since I C P,, IP(' C P,P, + =A. Also, AC P>+ so T=IAC 
‘a Ps We claim that in fact IC IP, | This follows by the same 
argument as in the proof of part (c) of Lemma 5.3.8: If J = IP, * 
then for every 6 € P, 1 BI CI for every positive integer k, and 
looking at. the fractional ideal generated by I and {6* | k = 1,2, ...} 
we see that ( is integral over A so 8 € A; contradiction. 

Thus, = 2P, ! is an ideal of A properly containing I, so J 
can be expressed as as product of prime ideals J = P) ... P,. Then 
I=P,P,... P,. This product may have some repeated factors; if so 
we can group them together and renumber to obtain an expression 
for J as in the statement of the theorem. Note that in this case each 
of the exponents is positive. 

Now let J be a fractional ideal of A. Let d be a denominator for 
I. Then I’ = I(Ad) is an ideal of A, so I’= Pf’... aie Also, Ad is 


an ideal of A, so Ad = Pe ... PE*. But then 


Let AQ a P pa ee 


jt+l 


(where if there is any duplication in the prime ideal factors of I’ and 
Ad, we combine terms.) 
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Now for uniqueness. Suppose that we have a fractional ideal I of 
A with I expressed as a product of prime ideals in two ways: 


T=Pe 3, PO =Or .. OH 


Multiplying both sides by suitable powers of these prime ideals and 
renumbering as necessary, we may assume that we have an ideal J’ 
of A with e; > 0 for each i and f; > 0 for each j, and that the sets 
of prime ideals {P,, ..., Py} and {Q1, ..., Qe} are disjoint. Then 


Pi > P?... Pe#=Ql... Ql 


and each of these ideals is prime, so in particular P; D Q; for 
some j. (Otherwise, let g; € Q;, q; ¢ Pi for each j. Then qi. 
git € P,, which is impossible as P, is a prime ideal.) But Q;, being 
prime, is also maximal, and so P; = Q,; contradiction. 


Corollary 5.3.10. In the situation of Theorem 5.3.9, then I is an 
ideal of a Dedekind ring A if and only if e; > 0 for each i. 


Proof. Certainly if each e; > 0 then J is an ideal of A, and we 
observed the converse in the course of proving Theorem 5.3.9. 


Corollary 5.3.11. The set of nonzero fractional ideals of a Dedekind 
ring A forms an abelian group under multiplication of fractional 
ideals. 


Proof. Multiplication of fractional ideals is commutative and asso- 
ciative, and A is the identity. Also, every fractional ideal J has 
an inverse: If J = Py’,..., Pe as in Theorem 5.3.9, then J~! = 
Pit sine ges 


Theorem 5.3.12. The group of nonzero fractional ideals of a 
Dedekind ring A is a free abelian group with basis {nonzero prime 
ideals of A}. 


Proof. Referring back to Definition 2.6.22 and Definition 2.6.24 
of what it means for an abelian group G to be free with a basis 
B, we see that this immediately follows from Theorem 5.3.9. (Note 
here that we are writing this group multiplicatively rather than 
additively.) 
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Theorem 5.3.13. Let F be an algebraic number field and let A = Or 
be the ring of algebraic integers of F. Then Theorem 5.8.9, Corollary 
5.3.10, Corollary 5.3.11, and Theorem 5.3.12 hold for A. 


Proof. By Theorem 5.2.10, A = Of is a Dedekind ring. 


5.4 Examples 


In this section, we do a bunch of examples. We do both fami- 
lies of examples and individual examples, and at the end we do a 
“nonexample” . 

Here will be our general set-up throughout. We will let F = 
Q(VD) be a quadratic extension of Q, and let A = O(VD), the 
ring of algebraic integers of F. Then G = Gal(F/Q) is a group of 
order 2, with the nontrivial element of G being o: F > F defined by 
a(VD) = —VD. If I is an ideal of A, we let I = o(1). 

We will denote the ideal J generated by elements aj, ..., a, of A 
by <a1,..., @ >. Often parentheses are used for this, but we will 
be using parentheses to group elements for multiplication, as usual, 
so we choose this notation to avoid ambiguity. 

We will begin by considering the case D < 0, and afterwards 
consider the case D > 0. 


Lemma 5.4.1. Let D < 0 and suppose |D| is a squarefree composi- 
tive positive integer |D| =p, ... py with k > 2. Then 


|D| = (p1)(p2) --- (pe) = —(VD)? 


are two factorizations of |D| into irreducibles. None of these irre- 
ducibles are prime. 

If I; =< p3,V/D >, i = 1,..., k, then each I; is a prime ideal, 
with I? =< p; >, a principal ideal. Furthermore I; = I;. 

Then, we have factorizations 


<D>=f?...I2 and <VD>=1,... I. 


Proof. Recall that we have a multiplicative norm on A given by 
N(a+bVD) = |\a? — b?D| and it is easy to check from Lemma 3.9.1 
that each of p;,..., pk and VD is irreducible. The factorization of 
|D| given in the statement of the lemma shows that none of these 
are prime. 
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Let [=<p, VD > for any p dividing D. Clearly [ = I. We observe 
that I is a proper ideal of A, as N(q) is divisible by p for every a € I. 
We now show that J is a maximal ideal of A, and hence prime. 

To this end, let I’ > I and let a € I’, a ¢ I. Write a=a+bVD. 
If D = 1 (mod 4) replace a by 2a if necessary, so that a,b € Z. Since 
I’ is an ideal, 8 = a+ (—bVD) = a € I’, and since I is an ideal, 
B € 1. Thus a € Z is not divisible by p, and so, since p is a prime in 
Z, there are integers x and y with ax + py = 1. Thus 1 € J’ and so 
rea, 

Now 


P=<p,VD><p,VD >= <p’,pvD,pvD, D > 
= <p’,pVD,p(D/p) > C <p>. 


But p and D/p are relatively prime so there are integers x and y 
with px + (D/p)y = 1 and so p € I*. Thus, I? = <p>. Then, we 
immediately see that <D> = <p|>...<pp> =I? ... i 

More subtly, let us consider 


J=h...Ik=<p1,VD> <p2,VD> ... <py,VD >. 


Set q; = D/p;, for convenience. 

Generators of the product ideal J are given by multiplying gener- 
ators of each of the factors I;. If we choose any generator to be VD, 
then this product is divisible by VD, so is an element of < /D >. 
The only other choice is p, ... py = —D, which is also divisible by 
VD, so is also an element of < /D >. Thus JC < /D>. 

On the other hand, we can choose VD from the first factor and 
p2,..., Pk from the remaining factors, so qi/D € J. Similarly, 
qVD € J for each i. But {q, ..., q} is relatively prime, so, as 
above, /D € J andso < VD > C J. Thus, J=<VJVD>. 


Lemma 5.4.2. Let D < 0 and suppose |D| = p a prime with p=1 
(mod 4). Then 


p+1=2((p + 1)/2) = (1+ VD)(1- VD) 


are two factorizations of p+ 1 into irreducibles. None of these irre- 
ducibles are prime. 
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If ly = <2,1+VD > and lh = < (p+ 1)/2,1+ VD >, then 
and Iz are nonprincipal prime ideals, with I? = <2> and If = < 
(p+1)/2 >, both principal ideals. Furthermore, 1, = I, but In 4 In. 

Then we have factorizations 


<pt+1>=Fhh and <1+VD>=hh, 
<1-W7Ds=ib. 


Proof. For simplicity, set r = (p + 1)/2 and note that r is an odd 
integer with r > 1. 

Again each of 2, r, and 1+ VD is irreducible by Lemma 3.9.1, 
but not prime. 

Note that D = —p = 3 (mod 4). Thus A = {a+ bVD | a,b € Z}. 
Clearly i = J, but Ih # Io, as if In = Ig, then r € In, 2 = (1 + 
VD) +(1—VD) € In, and then Iz = A. But I, (resp. Iz) is a proper 
ideal of A, as N(q) is divisible by 2 (resp. r) for every a € I; (resp. 
ae In). 

J, and Ig are both maximal, and hence prime, ideals by an argu- 
ment similar to the proof of Lemma 5.4.1. It is easy to check that 
they are not principal. 

Now 


B=hh=<21+VD><2,1-VD> 
=< 4, 01 4/D) 20 +4/D) 1 4/ Dl WD) > 
=<4,2(114+V7D),1—D>=<4,2(14+ VD), 2r > 


= <2> 


as r is odd. 
Also, 


Inlg = <r,14+VD><17,1-VD> 
=<r',r(1—VD),r(1+ VD),2r >=<r> 


again as r is odd. 
Thus, 


<ptlse—<2sers=F hi. 
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More subtly, let us consider 
Ib =<2,14VD><r1+VD> 
= < 2r,2(1+ VD),r(1+ VD), (1+ VD)(1+ VD) > 
=< (1+VD)(1— VD),2(1+ VD), 
r(l+VD),(1+VD)(1+ VD) > 
=<1+VD> 


once again as r is odd, and similarly 5 =<1—WJVD >. 


Remark 5.4.3. We saw in Chapter 3 that for D = —1,—2,—3, 
A= O(VD) is a Euclidean domain and hence a PID. So this leaves 
the cases D = —p for p = 3 (mod 4), p > 3, open. If p = 3 (mod 
4) then p = 3 or 7 (mod 8). It can be shown by arguments similar 
to that in Chapter 3 that for D = —7, A = O(VD) is a Euclidean 
domain, and hence a PID. It can also be shown by arguments similar 
to that of Lemma 5.4.2 that if D = —p, for p = 7 (mod 8), p > 7, 
then A = O(VD) is not a PID. 
This takes care of all negative values of D except D = —p 
for p a prime, p = 3 (mod 8). It can be shown that for D = 
11, —19, —43, —67, —163, A = O(VD) is a PID. 
Assembling these results, we see that A = O(V/D) is a 
PID for the following nine values of D: D = —1,-—2,-—3,—7,-11, 
19, —43, —67,—163. This was known to Gauss. Gauss also con- 
jectured that these are the only negative values of D for which 
A = O(VD) is a PID. This turns out to be true. That is a 
very deep fact, and one of the great theorems of 20th century 
mathematics. v) 


Example 5.4.4. (a) Let D = —5 and let us reconsider the factor- 
izations in Example 3.9.5(a). We had 


6=2-3= (1+ V=8)(1- v=B), 
9=3-3 = (24+ V—5)(2— V—5). 
The first of these is a special case of Lemma 5.4.2, so we concentrate 


on the second. - 
Let J =< 3,1+ /—5 > so that J=< 3,1-—/V-5 >. 
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You may check that these are both maximal, and hence prime, 
ideals. Then 


l= <3,1+V-5 ><3,1-V-5> 
= 29.301 4-4/5), 31 —4/=5),6 SSe 35 
sO we see 
29SS<23 543 >= U1 i1). 
We also compute 
P=<3,1+V—-5 ><3,14+V—-5> 
=< 9,3(1 44/—5),3(1 4-4/9), 1 5S 
= < 9,3(1+ V—5),-4+2V/-5 > 
= < (2— V—5)(2 + V—5), (2 — v—5) 
(—1+ v—5), (2 — V—5)(—2) > 
= 29 —4/-5 > 


and similarly 


P=<2+V-5> 
sO we see 
205220475 so 9-7-5 S770) 


and so both expressions give the same factorization of the ideal < 9 > 
into a product of prime ideals. 

Note that I and J are not principal ideals, but JJ, [?, and I? are 
principal ideals. 


(b) Let D = —6 and let us reconsider the factorizations in Exam- 
ple 3.9.5 (b). We had 
622.8 = —1(/—6F, 


10=2-5= (2+ V—-6)(2— V-6). 


The first of these is a special case of Lemma 5.4.1, so we concentrate 
on the second. 


396 An Introduction to Abstract Algebra: Sets, Groups, Rings, and Fields 


Let I; = < 2,,/—6 > sothat 4, = hh, and let 5b =< 5,2+-/—6 > 
so that Ip =< 5,2—J/-6>. 

You may check that these are both maximal, and hence prime, 
ideals. Then 


=<) /—6 S<2)4/—6 SSe 4,9 —6,2/ —6,-6 > =< 2S 


and 


Ioln = < 5,2 4+/—6 >< 5,2 —/—6 > 
= < 25,5(2 + /—6), 5(2 — /—6), 10> =<5> 


sO we see 
<10 >=<2><5>=7bh. 
We also compute 


lilo = < 2,./-6 >< 5,2++/-6 > 
=< 10,4-+4 2/6, 5V/—6, -—6 + 2-6 > 
= < 10,44 2V/-6, 5V—6, -10 > 
= < 10,44 2-6, 5V—6 > 
= < 10,44 2V-6, 5/—6, 10 + 5-6 > 
= < (2+ V—6)(2— V—6), (2+ V—6)(2), 

(2+ /—6)(5)) > 

= <2+/-6> 


and similarly 
hh=<2-VJ-6> 
sO we see 
<10>=<24+VV-6><2-V-6> = (hh)(hh) 


and so both expressions give the same factorization of the ideal 
< 10 > into a product of prime ideals. 
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We further compute 
B= <5,2+/-6><5,2+V-6> 
= < 25,5(2 + /—6), 5(2 + /—6), -2+ 4-6 > 
= < 25,5(2 + /—6),-2+4/-6 > 
= < (-1+ 2/—6)(—1— 2vV-6), (-1 + 2V-6) 
(2 — V—6), (-1 + 2V—6) (2) > 
=< =140/-6 > 


and similarly 


B=<-1-2/-6>. 


We note that I (3, da, and Jy are not principal ideals but 
P Ih, Th, 3, and ie are principal ideals. 
(c) Let D = —26 and observe that 


7= 3" =O +i) = 7-26). 


You may check that these are two distinct factorizations of 27 into 
a product of irreducibles. 

Let I =< 3,1+.+/—26 > so that I =< 3,1 —./—26 >. You may 
check that these are both prime ideals. Then 


IT = <3,1+V—26 >< 3,1-— /—26 > 
= < 9,3(1 — /—26), 3(1 + /—26), 27 > 
= < 9,6,3(1 + V—26),27 >=<3> 


SO we see 
ao S=(s): 
We also compute 
P= <3,1+ V—% >< 3,14 V—26> 
= < 9,3(1 + V—26), 3(1 + V—26), —25 + 2\/—26 > 
= < 9,3(1 + /—26), —25 + 2\/—26 > 
= < 9,3(1 + V—26), 2(1 + V—26) > 


=<9,1+/-26> 
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and further that 
PaPrrs<0,144/—96 s< 3, 14+-+/—26 
= < 27, 9(1 + /—26), 3(1 + /—26), -25 + 2\/—26 > 
= < 27,3(1 + /—26), —25 + 2/—26 > 
= < 27,3(1 + V—26), 2(1 + /—26) > 
= < 27,1+ V/—26 > 
= < (1+ /—26)(1 — /—26),1 + /—-26 >=< 14+ 7-26 > 


and similarly J? =< 9,1 — ./—26 > and =< 1 — ./—26 > so we 
see 


297 S=(F (FP) 
and so both expressions give the same factorization of the ideal 


< 27> into a product of prime ideals. 


Note that I, I?, I, I? are not principal ideals but IJ, I?, and I? 
are principal ideals. © 


Example 5.4.5. Let D = 6 and consider the factorizations 
6 = (2)(3) = (VO), 


At first glance this appears to be an example of two distinct fac- 
torizations of 6 into a product of irreducibles, just as in Example 
5.4.4 (b), but in this case appearances are deceiving. It is not. 

We have that 


2=—(2+ V6)(2 — v6), 
3 = (3+ V6)(3 — V6) 


so that 


= ~(V8)(v6) 


so these are the same factorization. 
In fact, O(\/6) is known to be a PID. » 
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On the other hand, we have the following class of examples, which 
generalizes Example 3.9.5(c). 


Lemma 5.4.6. Let D = 2p where p is a prime with p= 5 (mod 8). 
Then 


D = (2)(p) = (VD) 


are two factorizations of D into irreducibles. None of these irre- 
ducibles are prime. 

If I, = < 2,V/D > and Ip =< p,VD > then I, and Iy are both 
prime ideals, with I? =< 2> and I3 =< p>, both principal ideals. 
Furthermore, I, = qh, Ip = In. Then we have factorizations: 

<D>=I?I3 and <VD>=hh. 
Proof. We have a multiplicative norm on A given by N(a+bVD) = 
|a2—b? D|. Then N(2) = 4, N(p) = p?, and N(D) = 2p. We will show 
that A does not have an element of norm 2, and also does not have 
an element of norm p, and then it follows from Lemma 3.9.1 that 2, 
p, and VD are all irreducible. 

Suppose a € A with N(a) = 2. Writing a = a+ bVD with 
a,b€Z, 

Ja? — b?(2p)| = 2 


a” — 2pb? = +2 


and reducing mod p, 
a” = +2 (mod p) 
which has no solution by Corollary 2.7.15 and Corollary 2.7.16. 


Suppose a € A with N(a) = p. Writing a = a+ bVD with 
a,b€Z, 


Ja? — b?(2p)| =p 
= =p + b? (2p) = p(2b? ale 1) 
Hence, a must be divisible by p, in which case 2b? + 1 is divisible by 
Dp, 1.€., 


2b? + 1 = 0 (mod p) 


which again has no solution by Corollary 2.7.15 and Corollary 2.7.16. 
The rest of the proof of this lemma is identical to the proof of 
Lemma 5.4.1. 
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Remark 5.4.7. If A = O(VD), then A has norm N(a+bVD) = 
|a2 — b?D|. Note that if D < 0, —D > 0, so it is easy to determine 
whether A has an element of any given norm n: There are only finitely 
many possibilities for a and 6, as we must have |a| < \/n and |b| < 
/n/D. 

On the other hand, if D > 0, —D < 0 and it is a much more 
subtle question to determine whether A has an element of norm n: 
There are infinitely many possibilities for a and b. (Compare the case 
D = 6, where in Example 5.4.6 we saw that A has an element of norm 
2, with the case D = 10, where as a special case of Lemma 5.4.6 A 
does not have an element of norm 2.) Compare also the generality of 
Lemma 5.4.1 and Lemma 5.4.2, both for D < 0, with the much more 
specific case of Lemma 5.4.6, for D > 0, and note we had to use some 
specific number-theoretical information to prove Lemma 5.4.6. 

In fact, much more is known about O(VD) for D < 0 than is 
known for D > 0. Gauss conjectured that there are infinitely many 
values of D > 0 for which O(/D) is a PID (though he did not state 
the conjecture in this language) and this conjecture is still completely 
open. (Contrast this with Remark 5.4.3.) o) 


We conclude with an example that illustrates what can go wrong 
when we are dealing with a ring that is not a Dedekind ring. 


Example 5.4.8. Let R = Z|,/—3] = {a+ b/—3 | a,b € Z}. Then R 
has quotient field F = Q(./—3), but R 4 Of and R is not a Dedekind 
ring. We have the factorizations 


4 = (2)(2) = (1+ V—-3)(1 — v-3) 


and you may check that these are two distinct factorizations of 4 into 
a product of irreducibles in R. 

You may also check that J = < 2,1+ /—3 >=< 2,1- /—3 > 
is a maximal, and hence prime, ideal, and furthermore that J is the 
unique prime ideal with J D> < 2 >, I is the unique prime ideal 
with I > < 1+4/—3 5, and I is the unique prime ideal with 
fo.<l=w 3S 

You may further compute that I? =< 4,2(1 + /—3) >= 
< 4,2(1 — /—3) > and so 


1529550, Poeitqy-is5l, Pai S5F 
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Thus, the ideals < 2 >, < 1+ /—3 > and < 1— V—3 > do not 
have factorizations into a product of prime ideals. 

Also, note that I? =< 2 > I so we do not have “cancellation” in 
multiplying products of nonzero prime ideals, as we do in Dedekind 
rings. 

Finally, observe that for any positive integer k, [* =< 2*-! > 17, 
so that no power of J is a principal ideal. % 


5.5 Further developments 


In this section, we want to mention some further results in algebraic 
number theory that would take us beyond the bounds of this book 
to prove, but which we feel the reader should be aware of. 


Definition 5.5.1. Let A be a Dedekind ring and let G be the group 
of nonzero fractional ideals of A under multiplication. Let H be the 
subgroup of G that is the group of nonzero principal fractional ideals 
of A under multiplication. Then the ideal class group Cl(A) is the 
quotient Cl(A) = G/H. ©) 


Theorem 5.5.2. Let F be an algebraic number field, and let A = Of. 
Then Cl(A) is a finite group. 


Definition 5.5.3. Let F be an algebraic number field. Then the 
class number h(F) of F is h(F) = |Cl(Of)|, the order of the ideal 
class group of Of. © 


Remark 5.5.4. We see that h(F) = 1 if and only if Og is a PID. > 


Corollary 5.5.5. Let I be any ideal in Or. Then there is a positive 
integer k for which I* is a principal ideal. 


Proof. Any element of a finite group has finite order. 


Theorem 5.5.6. For any algebraic number field F, there is an effec- 
tive procedure for finding the group Cl(Op) (and hence for finding 


h(E)). 


Remark 5.5.7. In the special case when F is a quadratic exten- 
sion of Q, ie., F = Q(VD), there is a formula, due to Dirichlet, for 
h(E). 0 
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5.6 Exercises 


1. 


Let A be a Dedekind ring and let J and J be nonzero frac- 
tional ideals of A. Let J and J have prime factorizations J = 
Pe soap a of = Pi er. Be, (Here we allow some e; or 
fi to be 0 so that the same prime ideals appear in both fac- 
torizations.) Show that J C J if and only if f; > e; for each 
4=1,...,k. 


. Let A be a Dedekind ring and let J and J be nonzero fractional 


ideals of A with I” C J” for some positive integer n (resp. I” = 
J” for some nonzero integer n). Show that IJ C J (resp. I = J). 


. Let J and J be nonzero ideals of a Dedekind ring A. We say that 


I divides J if there is an ideal kK of A with J = JK. Show that 
I divides J if and only if J C J. 


. We say that two nonzero ideals J and J of a Dedekind ring A are 


relatively prime if there is no prime ideal P that appears in both 
of the factorizations of J and J into products of prime ideals. 


(a) Suppose that J and J are relatively prime. Show that J and 
J are coprime, i.e., that J+ J= A. 

(b) Suppose that J and J are relatively prime. Show that INJ = 
IJ. 


. Let J and J be nonzero ideals of a Dedekind ring A. We say that 


a nonzero ideal G of A is a greatest common divisor (gcd) of I 
and J if 


(i) G divides both J and J; and 
(ii) If K is any nonzero ideal that divides both I and J, then Kk 
divides G. 


We say that a nonzero ideal LE of A is a least common multiple 
(Icem) of J and J if 


(i) Both J and J divide L; and 
(ii) If & is any nonzero ideal that is divisible by both J and J, 
then L divides Kk. 


(a) Show that any two nonzero ideals J and J of A have a gcd 
G and an lcm JL, and express the prime factorizations of G 
and L in terms of the prime factorizations of J and J. 

(b) If J and J are relatively prime, show that A = gcd(J, J) and 
that iJ = lem(d,./), 


10. 


11. 


12. 


13. 
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(c) Let H be any nonzero ideal of A. Show that 


gcd(HI,HJ) =H gcd(J, J) and 
lem(HT,AJ) = A lem(T, J). 


. Let J and J be nonzero ideals of a Dedekind ring A. Show that 


A/I and J/IJ are isomorphic. 


. (a) Let A be a Dedekind ring and let J be a nonzero ideal of A. 


Show that there is a nonzero ideal J of A, with J and J relatively 
prime, such that the ideal J.J of A is principal. 

(b) Let kK be any nonzero ideal of A. Show that we may choose 
the ideal J in (a) to be relatively prime to K. 


. Let A be a Dedekind ring with quotient field F, and let J and J 


be fractional ideals of A. Show that there are elements a and 8 
of F such that aJ and £J are relatively prime ideals of A. 


. Let A be a Dedekind ring with only finitely many prime ideals. 


Show that A is a PID. 

Let A be a Dedekind ring and let P be a nonzero prime ideal of 
A. Let R be the localization of A at P. Show that R is a discrete 
valuation ring. 

Let A be a Dedekind ring and let J be a nonzero ideal of A. Let 
a #0 be an arbitrary nonzero element of J. Show that there is 
an element 6 of J such that J =< a, >. Thus, in particular, 
in a Dedekind ring every ideal can be generated by at most two 
elements. 

Let A be a Dedekind ring and let J be a nonzero ideal of A. 


(a) Show that A/J has only finitely many ideals. 
(b) Show that every ideal of A/T is principal. 


(Of course, if J is prime then A/T is a field. But if J is not prime, 
A/I is not an integral domain.) 

Let A be an integral domain in which every nonzero ideal factors 
uniquely as a product of prime ideals. Show that A is a Dedekind 
ring. 

(As a matter of historical fact, Dedekind proved that the ring 
of algebraic integers of any algebraic number field has unique 
factorization as in Theorem 5.3.9. Later, E. Noether abstracted 
the properties of these rings that make this work and defined a 
Dedekind ring as in Definition 5.2.7.) 
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Let S = {pi,..., px} be a set of district primes, and let D = 
—p1,°-: , Py. Then the subsets of S form a group G of order 2* 
under the operation of symmetric difference, which we denote by 
* Let Go be the subgroup Go = {¢, S'} of order 2. For any subset 
T of S, let dr be the product of the elements of T (with dr = 1 
if T = ¢) and let Ip be the ideal Ip =< dr, VD > of O(VD). 


(a) Let 7: G > Cl(O(VD)) be the map given by 7(T) = [Ir], 
where [I] is the equivalence class of Ip in Cl(O(VD)). Show 
that [I7,«7,| = [I7,]|17], so that 7 is a group homomorphism. 

(b) Show that Ir is a principal ideal if and only if T € Go. Hence, 
Ker(z) = Go. 

Thus, we see that Im(z) is a subgroup of Cl(O(VD)) iso- 
morphic to G/Go, a group isomorphic to (Z/2)*-!. 

(c) Suppose that D = 3 (mod 4). Let J = < 2,1+WD >. Show 
that J is not a principal ideal of O(VD), and also that JI 
is not a principal ideal of O(VD) for any T C S, while J? is 
a principal ideal of O(VD). 

Thus, we see that in this case Im(7) and [J] generate a 
subgroup of Cl(O(WD)) isomorphic to (Z/2)*. 

Thus, we conclude that if {p1,..., py} is a set of distinct 
primes, and D = —p,,..., pp, then h(O(VD)), the class number 
of O(VD), is divisible by 2*~!, and if D = 3 (mod 4), h(O(VD)) 
is divisible by 2". 

Note that this exercise generalizes the work we did in Lemma 

0.4.1. 

Let n be a positive integer and let g > 1 be an odd integer. Let 

a be a positive integer that is relatively prime to g. Let D be 

the unique squarefree integer that is defined by b?D = a? — q”, 

where 6 is a positive integer. Let I be the ideal 


I=<q,a+bVD> 
of O(VD). 


(a) If f =< q,a—bVD >, show that II =< q >, a principal 
ideal. 

(b) Show that I* =< q*,a+bWVD > for every positive integer 
ke 

(c) Show that I” =< a+bVD >, a principal ideal. 


16. 
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Conclude that there is a homomorphism ay: Zn, —> 
Cl(O(VD)) given by m;(k) = [I*]. Observe that Im(m) = 
Im(m;). Hence, Cl(O(VD)) has a subgroup Im(z,) isomor- 
phic to Z; for some integer j dividing n, and hence that 
h(O(VD)) is divisible by this integer j. 

(d) Show that j is the smallest positive integer such that q/ 
and a+ bVD have a common nonunit factor in O(VD). In 
particular, if g* and a+bVD do not have a common nonunit 
factor in O(VD) for any proper divisor k of n, then j = n, 
and so in this case h(O(VD)) is divisible by n. 

(e) Suppose that D < 0. Let p be the smallest prime factor of n, 
and set m = n/p. (If n is prime, then m = 1.) Suppose that 
|D| > q™ if D = 2 or 3 (mod 4), and |D| > 4g” if D=1 
(mod 4). Show that 7 = n in (d), so that in this case Im(z,) 
is isomorphic to Z, and hence, h(O(VD)) is divisible by n. 

(f) Give examples of D with O(VD) divisible by n for each 
n=2,..., 10. 

(g) Let gq =3 and let a=2-5-7-11=770. 

Let p > 11 bea prime and let b? be the largest perfect square 
dividing g?—a?. (Note that 311 > 7707.) Let D = (a?—q?)/b?. 
Show that Cl(O(VD)) has an element of order p, and hence 
that h(O)VD)) is divisible by p. 

Let F be an algebraic number field and let A = Of be the ring 

of algebraic integers of F. 

(a) Let I be a nonzero ideal of A. Show that [MZ is a nonzero 
ideal of Z. 

(b) Let P be a nonzero prime ideal of A. Show that PN Z is a 
nonzero prime ideal of Z. 

(c) Let P be a nonzero prime ideal of A. Show that P= < p,8 > 
for a unique prime p € Z and some element {3 € P. 

Let F be an algebraic number field that is an extension of Q of 

degree n, and let A = Of be the ring of algebraic integers of F. 

Let I be a nonzero ideal of A. Define the norm N(I) by 


N(I) = #(A4/D), 
the cardinality of the quotient A/T. 


(a) Let a € Z be a nonzero element and let J =< a > be the 
ideal of A generated by a. Show that N(I) = |a|”. 
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(b) Show that N(J) is finite for every nonzero ideal I of A. 

(c) Let J and J be nonzero ideals of A. Show that N(IJ) = 
N(I)N(J). 

Let F = Q(VD) be a quadratic extension of Q, and let A = Og 

be the ring of algebraic integers in F. Let p be a prime in Z and 

let [ =< p> be the ideal of A generated by J. Show there are 

exactly three possibilities: 

(i) I is a prime ideal of A and N(JI) = p?. 

(ii) I = J? for some prime ideal J of A and N(J) = p. 

(iii) J = J, Jg for some distinct prime ideals J; and Jz of A and 
N(1) = N(J2) = 

(Here N(J) denotes the norm of the ideal J.) 

The prime p is said to be inert, to ramify, or to split in cases (i), 

(ii), and (iii) respectively. 


(a) Let F = Q(V—1). Determine the behavior of p, as in the 
previous problem, for every prime p in Z. 
(b) Let F = Q(./—2). Determine the behavior of p, as in the 


previous problem, for p = 2, 3,5, 7. 

(c) Let F = Q(/—5). Determine the behavior of p, as in the 
previous problem, for p = 2,3, 5, 7. 

(d) Let F = Q(./—6). Determine the behavior of p, as in the 
previous problem, for p = 2,3, 5, 7. 

Let F be an algebraic number fald and let A = Of be the ring 
of algebraic integers of F. 


(a) Let I be a nonzero ideal of A, and let m = N(J). Show that 
m €I and hence <m>C TI. 

(b) Show that there are only finitely many nonzero ideals of A 
with any given norm. 

(c) We have the following theorem, which we shall not prove 
here. 


Theorem. Let F be an algebraic number field and let Op be the 
ring of algebraic integers of F. Then there is a constant N with 
the property that if I is any nonzero ideal of Og, there is an ideal 
J of Og, with [I] = [J] in the ideal class group Cl(Og), and with 
Nal) SN. 

Show that this theorem implies that Cl(Of) is a finite group. 
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In this exercise, we will use the fact that for any algebraic number 
field F, the ideal class group Cl(Of) is finite. In particular, if 
Cl(Op) has order n, then for any nonzero ideal I of Op, I¢ is a 
principal ideal for some d dividing n, and for every nonzero ideal 
I of Op, I” is a principal ideal. 

Let I be an ideal of Op and let E be a finite extension of F. 
We let I be the ideal of Og generated by I. 


(a) If J; and Ig are two ideals of Og, show that ‘ae = Tle. 

(b) Let I be a nonzero ideal of Og with [4 = <a>, a principal 
ideal of Of. Let E = F(a) where a is a root of the polynomial 
a? —a € F(x]. Let J =< a > be the principal ideal of Og 
generated by a. Show that [ = J. Such an element a of Og 
is called an ideal element of I. (Note that for any b € IJ, the 
quotient b/a is an algebraic integer.) 

(c) Show that there is an extension E of F of degree at most 
n such that for every ideal I of Og, the ideal T of Og is 
principal. (Note this does not imply, and it is not in general 
true, that every ideal of Og is principal.) 


Let F = Q(./—5). As we have seen, we have the following non- 
principal ideals of Op: 


I= (2,1+ V-5) = (14+ V—-5,1 -— /—5) = (2,1-V-5) =T 
I — (3,1+ V—5), Ip = (3,1 —/—5) with Tp =1,1; = Yo. 


(a) Show that a = v2 is an ideal element of J, that ay = 


V2—/-—5 is an ideal element of [;, and that ag = 
2+ /—5 is an ideal element of Io. 


(b) Verify that a, ay, and a2 are indeed algebraic integers. 

(c) Verify that 2/a, (1+./—5)/a, and (1—./—5)/a are algebraic 
integers, that 3/a, and (1+ /—5)/qa, are algebraic integers, 
and that 3/a2 and (1 — /—5)/az are algebraic integers. 
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Appendix A: Some Properties 
of the Integers 


In this appendix we simply list some properties of the integers, most, 
if not all, of which you are probably familiar with. We list these in 
order to be able to get started, as we will need them in our study of 
group theory, with which we begin. But we do not prove these here, 
as we will prove all of them in a more general context in our study 
of ring theory. 


Theorem A.1 (The division algorithm). Let a and b be integers 
with b #0. Then there are unique integers q and r such that 


a=bg¢+r with 0<r< |). 


Theorem A.2. Let a and b be integers, not both 0. Then there is a 
unique positive integer g such that: 


(1) g divides both a and b; and 
(2) ifd is any integer that divides both a and b, then d divides g. 


Furthermore, there are integers xp and yo (not unique) such that 
g = aro + byo. 


Definition A.3. The integer g in Theorem A.2 is the greatest 
common divisor of a and b, g = gcd(a, b). o) 


Definition A.4. Two integers a and b, not both 0, are relatively 
prime if gcd(a, b) = 1. © 
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Lemma A.5 (Euclid’s lemma). Let a, b, and c be nonzero inte- 
gers. Suppose that a divides bc. If a and b are relatively prime, then 
a divides c. 


Corollary A.6. Let a, b, and c be nonzero integers. Suppose that 
a divides c and b divides c. If a and b are relatively prime, then ab 
divides c. 


Corollary A.7. Let a, b, and c be nonzero integers. Suppose that 
a and b are relatively prime, and that a and c are relatively prime. 
Then a and be are relatively prime. 


Lemma A.8. Let a and b be integers, not both 0. Let d be a common 
divisor of a and b. Then gcd(a/d, b/d) = gcd(a,b)/d. In particular, if 
d is a positive integer, then a/d and b/d are relatively prime if and 
only if d = gcd(a,b). 


Definition A.9. Let n be a positive integer. 

If n = 1, then n is a unit. 

If nm > 1, and the only positive integers dividing n are 1 and n, 
then n is a prime. 

Otherwise, n is composite. © 


Theorem A.10 (Fundamental theorem of arithmetic). Let n 
be a positive integer. Then n can be written as a product of primes 
in a unique way up to order, 1.e., 


N=pip2... De for some primes pj, ..., Pk 
and if also 
n=qMqQ.--.-q for some primes q,---, % 


then k = 1, and after possible reordering, pj = q,1=1,..., k. 


Equivalently, 
C= DT 24 pe for some distinct primes pj, ..., Dr 
and positive integers e1, ..., ex 
and if also 
— qi bo qi! for some distinct primes qi, ..-, 


and positive integers fi, ..., fi 
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then k = 1, and after possible reordering, pj = q; and e; = fi, i = 
alae 


Remark A.7. In the fundamental theorem of arithmetic, we regard 
the integer 1 as having the empty factorization. © 
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Appendix B: A Theorem 
from Linear Algebra 


In this appendix we prove a theorem that we have used in our devel- 
opment of field theory. 


Theorem B.1. A vector space V over an infinite field F is not the 
union of finitely many proper subspaces. 


Proof. We first consider the case where V is finite dimensional. 
Let n = dim (V). We proceed by induction on n. 

If n = 0 there is nothing to prove (as a 0-dimensional vector space 
has no proper subspaces). 

If n = 1 the theorem is trivial (as the only proper subspace of a 
1-dimensional vector space is the subspace {0}). 

Let n = 2. Choose a basis B = {b;, bo} of V. For f € FU {oo}, let 


art = b, + fbg if feF 
af = bg if f=c 


and let Wy be the 1-dimensional subspace of V spanned by ar. 


It is easy to check that {W,} are all of the 1-dimensional subspaces 
of V, and they are all distinct (with Wr, 1 Wy, = {0} if fi A fo). 
Note there are infinitely many of these. 

Now let {Uj, ..., Ux} be any set of finitely many proper sub- 
spaces of V. We may assume that none of them is the zero subspace. 
Then U, = Wy,,..., Ux = Wy, in above notation, for some subset 
{fi, .--, fg} of F U{oo}. But now choose any fp in F U {00} that is 
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not in this subset. Then Wy, Wy, = {0},i = 1, ..., k, so in particu- 
lar af, € V with af, GWU... UWg, and hence W1U...UW, AV. 

Now let n > 3. Suppose the theorem is true for any (n — 1)- 
dimensional vector space and let V be n-dimensional. Choose a basis 
B = {bi,b2, ..., bn} of V. With az as above, let Wy be the (n — 
1)-dimensional vector space with basis {a f,b3, ..., bn}. There are 
infinitely many of these subspaces (though they are certainly not all 
of the (n — 1)-dimensional subspaces of V). 


Now let {U;, ..., U,} be any set of finitely many proper subspaces 
of V. Since there are infinitely many Wy, there is some fo € FU {oo} 
with U; A W,, for any i =1,..., k. (We are not assuming that each 
U; is (n — 1)-dimensional, as if U; has dimension < n — 1, certainly 
U; 4 Wy.) 

Let U} = Uj Wy. Since U; # Wy, Uj is a proper sub- 
space of Wy,, so {Uj,..., U,} is a finite set of proper subspaces 


of W,,, a vector space of dimension n — 1, so by the induc- 
tive hypothesis there is some element a € Wy, with a ¢ Uj 
U--. UU;. But then a ¢ U,U -+- UUx, 80 V AF UU «++ Ug. 
(If a € Uj, then, since a € Wy,, a€ U; A Wy, = Uj; impossible.) 

Then by induction, in the finite dimensional case, we are done. 

Now let V be arbitrary, and let {U;, ..., Ux} be any set of finitely 
many proper subspaces of V. Since each U; is a proper subspace of 
V, there is an element b; of V with b; ¢ U;, for each i =1,..., k. 

Let Vo be the subspace of V spanned by {bi, ..., bg}, and note 
that Vo is finite dimensional (indeed, dim Vo < k). Let Uj = U;N Vo 
for i = 1,..., k and note that each Uj is a proper subspace of Vo 
(as b; € Vo but b; ¢ U!). Then, by the finite dimensional case, there 
is some a € Vo with a ¢ Uj U--- UUs. But then (as above) a ¢ 
U,U--- UU, soV AU,U --+ UUs. 
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finite, 278 
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prime, 240, 242-243, 259, 382 
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rational, 376 
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Lagrange’s theorem, 49 
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M 
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natural numbers, 13 

Noether, 403 

Noether isomorphism theorems, 65, 
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first isomorphism theorem, 66, 247 
second isomorphism theorem, 66, 
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norm, 180 

Euclidean, 180 
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one-to-one correspondence, 13 


P 


partial fractions, 256 
permutation, 124 
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even/odd, 133 
inversions in, 132 
sign of, 133, 218 
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polynomial 
cyclotomic, 299-300, 310, 348 
monic, 173 
primitive, 234 
separable, 315 
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ramify, 406 
split, 406 
primitive root, 218, 299, 347 
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quadratic residue character, 98, 
218 
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218 


R 


regular 17-gon, 357 
relation(s), 6 
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ideal, 407 
idempotent, 249 
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unit, 154, 184, 249 
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ring(s), 149 
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image, 159 
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integrally closed, 382 
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local, 259 
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Noetherian, 163-164, 176, 194, 227, 
229, 382 
p-adic integers of, 262 
polynomial, 151, 172, 176, 236 
principally Noetherian, 200 
product of, 152 
quotient map, 168, 242 
simple, 249 
subring of, 157 
trivial, 150 
valuation, 260 
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Russell’s paradox, 24 
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Schroéder-Bernstein theorem, 18 
set theory 
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complement of, 2 
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disjoint, 6 
element of, 1 
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equal, 2 
intersection of, 4 
partition of, 12 
power, 17 
product of, 6 
proper subset of, 3 
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symmetric difference of, 25 
uncountable, 16 
union of, 4 
solvable by radicals, 351, 353-354 
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Sylow theorems, 113 
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theorem of the primitive element, 
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Vv 
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Wilson’s theorem, 96 
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Z-module, 376 
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